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GENE EXPRESSION PROFILING IN BIOPSIED TUMOR TISSUES 

Background of the Invention 

Prnss-Reference 

5 This application claims the benefit under 35 U.S.C. 1 1 9(h) of provisional applications 

serial nos. 60/412,049, filed September 18, 2002 arid 60/364,890, filed March 13, 2002, the 
entire disclosures which are herby incorporated by reference. 

Field of the Invention 

10 The present invention relates to gene expression profiling in biopsied tumor tissues. In 

particular, the present invention concerns sensitive methods to measure mRNA levels in biopsied 
tumor tissues, including archived paraffin-embedded biopsy material. In addition, the invention 
provides a set of genes the expression of which is important in the diagnosis and treatment of 
. breast cancer. 

15 Oncologists have a number of treatment options available to them, including different 

combinations of chemotherapeutic drugs that are characterized as "standard of care," and a 
number of drugs that do not carry a label claim for a particular cancer, but for which there is 
evidence of efficacy in that cancer. Best likelihood of good treatment outcome requires that 
patients be assigned to optimal available cancer treatment, and that this assignment be made as 

20 quickly as possible following diagnosis. 

Currently, diagnostic tests used in clinical practice are single analyte, and therefore do not 
capture the potential value of knowing relationships between dozens of different markers. 
Moreover, diagnostic tests are frequently not quantitative, relying on immunohistochemistry. 
This method often yields different results in different laboratories, in part because the reagents 

25 are not standardized, and in part because the interpretations are subjective and cannot be easily 
quantified. RNA-based tests have not often been used because of the problem of RNA 
degradation over time and the fact that it is difficult to obtain fresh tissue samples from patients 
for analysis. Fixed paraffin-embedded tissue is more readily available and methods have been 
established to detect RNA in fixed tissue. However, these methods typically do not allow for the 
30 study of large numbers of genes (DNA or RNA) from small amounts of material. Thus, 
traditionally fixed tissue has been rarely used other than for immunohistochemistry detection of 
proteins. 

Recently, several groups have published studies concerning the classification of various 

cancer types by microarray gene expression analysis (see, e.g. Golub et al., Science 286:531-537 
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(1999); Bhattacharjae et al, Proc. Natl Acad. Sci. USA 98: 1 3790-1 3795 (2001); Chen-Hsiang et 
al, Bioinformatics 17 (Suppl. 1):S316-S322 (2001); Ramaswaroy et al, Proc. Natl. Acad. Sci. 
US4 98:15149-15154 (2001)). Certain classifications of human breast cancers based on gene 
expression patterns have also been reported (Martin et al, Cancer Res. 60:2232-2238 (2000); 
5 West et al, Proc. Natl Acad Sci. USA 98:11462-1 1467 (2001); Sorlie et al, Proc. Natl Acad 
Sci. USA 98:10869-10874 (2001); Yan et al, Owcer te* 61:8375-8380 (2001)). However, 
these studies mostly focus on improving and refining the already established classification of 
various types of cancer, including breast cancer, and generally do not provide new insights into 
the relationships of the differentially expressed genes, and do not link the findings to treatment 
10 strategies in order to improve the clinical outcome of cancer therapy. 

Although modem molecular biology and biochemistry have revealed more than 1 00 genes 
whose activities influence the behavior, of tumor cells, state of their differentiation, and their 
sensitivity or resistance to certain therapeutic drugs, with a few exceptions, the status of these 
genes has not been exploited for the purpose of routinely making clinical decisions about drug 
15 treatments. One notable exception is the use of estrogen receptor (ER) protein expression in 
breast carcinomas to select patients to treatment with anti-estrogen drugs, such as tamoxifen. 
Another exceptional example is the use of ErbB2 (Her2) protein expression in breast carcinomas 
to select patients with the Her2 antagonist drug Herceptin® (Genentech, Inc., South San 
Francisco, CA). 

20 Despite recent advances, the challenge of cancer treatment remains to target specific 

treatment regimens to pathogenically distinct tumor types, and ultimately personalize tumor 
treatment in order to maximize outcome. Hence, a need exists for tests that simultaneously 
provide predictive information about patient responses to the variety of treatment options. This 
is particularly true for breast cancer, the biology of which is poorly understood. It is Clear that 

25 the classification of breast cancer into a few subgroups, such as ErbB2 + subgroup, and subgroups 
characterized by low to absent gene expression of the estrogen receptor (ER) and a few additional 
transcriptional factors (Perou et al, Nature 406:747-752 (2000)) does not reflect the cellular and 
molecular heterogeneity of breast cancer, and does not allow the design of treatment strategies 
maximizing patient response. 



30 



Summary of the Invention 
The present invention provides (1) sensitive methods to measure mRNA levels in 
biopsied tumor tissue, (2) a set of approximately 190 genes, the expression of which is important 
in the diagnosis of breast cancer, and (3) the significance of abnormally low or high expression 
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* ■ . * 

lor tne genes identified and included in the gene set, through activation or disruption of 
biochemical regulatory pathways that influence patient response to particular drugs used or* 
potentially useful in the treatment of breast cancer. These results permit assessment of genomic 
evidence of me efficacy of more than a dozen relevant drugs. 
5 The present invention accommodates the use of archived paraffin-embedded biopsy 

material for assay of all markers in the set, and therefore is compatible with the most widely 
available type of biopsy material. The invention presents an efficient method for extraction of 
RNA from wax-embedded, fixed tissues, which reduces cost of mass production process for 
acquisition of this information without sacrificing quality of the analysis. In addition, the 
10 invention describes a novel highly effective method for amplifying mRNA copy number, which 
permits increased assay sensitivity and the ability to monitor expression of large numbers of 
different genes given the limited amounts of biopsy material. The invention also captures the 
predictive significance of relationships between expressions of certain markers in the breast 
cancer marker set. Finally, for each member of the gene set, the invention specifies the 
15 oligonucleotide sequences to be used in the test. 

In one aspect, the invention concerns a method for predicting clinical outcome for a 
patient diagnosed with cancer, comprising 

determining the expression level of one or more genes, or their expression products, 
selected from the group consisting of p53BP2, cathepsin B, cathepsin L, Ki67/MiBl, and 
20 thymidine kinase in a cancer tissue obtained from the patient, normalized against a control gene 
or genes, and compared to the amount found in a reference cancer tissue set, 

wherein a poor outcome is predicted if: 

(a) the expression level of p53BP2 is in the lower 1 0 th percentile; or 

(b) the expression level of either cathepsin B or cathepsin L is in the upper 10 th 
25 percentile; or 

(c) the expression level of any either Ki67/MiBl or thymidine kinase is in the upper 
10 th percentile. 

Poor clinical outcome can be measured, for example, in terms of shortened survival or 
increased risk of cancer recurrence, e.g. following surgical removal of the cancer. 
30 In another embodiment, the inventor concerns a method of predicting the likelihood of 

the recurrence of cancer, following treatment, in a cancer patient, comprising determining the 
expression level of p27, or its expression product, in a cancer tissue obtained from the patient, 
normalized against a control gene or genes, and compared to the amount found in a reference 
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cancer tissue set, wherein an expression level in the upper 10th percentile indicates decreased 
risk of recurrence following treatment. 

In another aspect, the invention concerns a method for classifying cancer comprising, 
determining the expression level of two or more genes selected from the group consisting of 
Bcl2, hepatocyte nuclear factor 3, ER, ErbB2, and Grb7, or their expression products, in a cancer 
tissue, normalized against a control gene or genes, and compared to the amount found in a 
reference cancer tissue set, wherein (i) tumors expressing at least one of Bcl2, hepatocyte 
nuclear factor 3, and ER, or their expression products, above the mean expression level in the 
reference tissue set are classified as having a good prognosis for disease free and overall patient 
survival following treatment; and (ii) tumors expressing elevated levels of ErbB2 and Grb7, or 
their expression products, at levels ten-fold or more above the mean expression level in the 
reference tissue set are classified as having poor prognosis of disease free and overall patient 
survival following treatment 

All types of cancer are included, such as, for example, breast cancer, colon cancer, lung 
cancer, prostate cancer, hepatocellular cancer, gastric cancer, pancreatic cancer, cervical cancer, 
ovarian cancer, liver cancer, bladder cancer, cancer of the urinary tract, thyroid cancer, renal 
cancer, carcinoma, melanoma, and brain cancer. The foregoing methods are particularly suitable 
for prognosis/classification of breast cancer. 

In all previous aspects, in a specific embodiment, the expression level is determined using 
RNA obtained from a formalin-fixed, paraffin-embedded tissue sample. While all techniques of 
gene expression profiling, as well as proteomics techniques, are suitable for use in performing 
the foregoing aspects of the invention, the gene expression levels are often determined by reverse 
transcription polymerase chain reaction (RT-PCR). 

If the source of the tissue is a formalin-fixed, paraffin embedded tissue sample, the RNA 
is often fragmented. 

The expression data can be further subjected to multivariate analysis, for example using 
the Cox Proportional Hazards model. 

In a further aspect, the invention concerns a method for the preparation of nucleic acid 
from a fixed, wax-embedded tissue specimen, comprising: 

(a) incubating a section of the fixed, wax-embedded tissue specimen at a temperature 

of about 56 °C to 70 °C in a lysis buffer, in the presence of a protease, without prior dewaxing, to 

•» 

form a lysis solution; 

(b) cooling the lysis solution to a temperature where the wax solidifies; and 

(c) isolating the nucleic acid from the lysis solution. 
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The lysis buffer may comprise urea, such as 4M urea. 

In a particular embodiment, incubation in step (a) of the foregoing method is performed at 
about 65 b C. . 

In another particular embodiment, the protease used in the foregoing method is proteinase 

5 • K. 

In another embodiment, the cooling in step (b) is performed at room temperature. 

In a further embodiment, the nucleic acid is isolated after protein removal with 2.5 M 

NH40AC. 

The nucleic acid can, for example, be total nucleic acid present in the fixed, wax- 
10 embedded tissue specimen. 

In yet another embodiment, the total nucleic acid is isolated by precipitation from the 
lysis solution, following protein removal, with 2.5 M NH4OAC The precipitation may, for 
example, be performed with isopropanol. 

The method described above may further comprise the step of removing DNA from the 
15 total nucleic acid, for example by DNAse treatment. 

The tissue specimen may, for example, be obtained from a tumor, and the RNA may be 
obtained from a microdissected portion of the tissue specimen enriched for tumor cel ls 

All types of tumor are included, such as, without limitation, breast cancer, colon cancer, 
lung cancer, prostate cancer, hepatocellular cancer, gastric cancer, pancreatic cancer, cervical 
20 cancer, ovarian cancer, liver cancer, bladder cancer, cancer of the urinary tract, thyroid cancer, 
renal cancer, carcinoma, melanoma, and brain cancer, in particular breast cancer.. 

The method described above may further comprise the step of subjecting the RNA to 
gene expression profiling. Thus, the gene expression profile may be completed for a set of genes 
comprising at least two of the genes listed in Table 1 . 
25 Although all methods of gene expression profiling are contemplated, in a particular 

embodiment, gene expression profiling is performed by RT-PCR which may be preceded by an 
amplification step. 

In another aspect, the invention concerns a method for preparing fragmented RNA for 
gene expression analysis, comprising the steps of: 
30 (a) mixing the RNA with at least one gene-specific, single-stranded DNA scaffold 

under conditions such that fragments of the RNA complementary to the DNA scaffold hybridize 
with the DNA scaffold; 

(b) extending the hybridized RNA fragments with a DNA polymerase to form a 
DNA-DNA duplex; and 
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(c) removing the DNA scaffold from the duplex. 

In a specific embodiment, in step (b) of this method, the RNA may be mixed with a 
mixture of single-stranded DNA templates specific for each gene of interest 

The method Can further comprise the step of heat-denaturing and reannealing the 
5 duplexed DNA to the DNA scaffold, with or without additional overlapping scaffolds, and 
further extending the duplexed sense strand with DNA polymerase prior to removal of the 
scaffold in step (c). 

The DNA templates may be, but do not need to be, fully complementary to the gene of 
interest. 

10 in a particular embodiment, at least one of the DNA templates is complementary to a 

specific segment of the gene of interest. 

In another embodiment, the DNA templates include sequences complementary to 
polymorphic variants of the same gene. 

The DNA template may include one or more dUTP or rNTP sites. In this case, iin step (c> 
15 the DNA template may be removed by fragmenting the DNA template present in the DNA-DNA 
duplex formed in step (b) at the dUTP or rNTP sites. 

In an important embodiment, the RNA is extracted from fixed, wax-embedded tissue 
specimens, and purified sufficiently to act as a substrate in an enzyme assay. The RNA 
purification may, but does not need to, include an oligo-dT based step. 
20 In a further aspect, the invention concerns a method for amplifying RNA fragments in a 

sample comprising fragmented RNA representing at least one gene of interest, comprising the 
steps of: 

(a) contacting the sample with a pool of single-stranded DNA scaffolds comprising 
an RNA polymerase promoter at the 5' end under conditions such that the RNA fragments 

25 complementary to the DNA scaffolds hybridize with the DNA scaffolds; 

(b) extending the hybridized RNA fragments with a DNA polymerase along the DNA 
scaffolds to form DNA-DNA duplexes; 

(c) amplifying the gene or genes of interest by in vitro transcription; and 

(d) removing the DNA scaffolds from the duplexes. 

30 An exemplary promoter is the T7 RNA polymerase promoter, while an exemplary DNA 

polymerase is DNA polymerase I. 

In step (d) the DNA scaffolds may be removed, for example, by treatment with DNase I. 
In a further embodiment, the pool of single-stranded DNA scaffolds comprises partial or 

complete gene sequences of interest, such as a library of cDN A clones. 
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In a specific embodiment, the sample represents a whole genome or a fraction thereof. In 
a preferred embodiment, the genome is the human genome. 

In another aspect, the invention concerns a method of preparing a personalized genomics 
profile for a patient, comprising the steps of: 
5 (a) subjecting RNA extracted from a tissue obtained from the patient to gene 

expression analysis; 

(b) determining the expression level in such tissue of at least two genes selected from 
the gene set listed in Table 1, wherein the expression level is normalized against a control gene 
or genes, and is compared to the: amount found in a cancer tissue reference set; 
10 (c) and creating a report summarizing the data obtained by the gene expression 

analysis. 

The tissue obtained from the patient may, but does not have to, comprise cancer cells. 
Just as before, the cancer can, for example, be breast cancer, colon cancer, lung cancer, prostate 
cancer, hepatocellular cancer, gastric cancer, pancreatic cancer, cervical cancer, ovarian cancer, 

15 liver cancer, bladder cancer, cancer of the urinary tract, thyroid cancer, renal cancer, carcinoma, 
melanoma, or brain cancer, breast cancer being particularly prefer 

In a particular embodiment, the RNA is obtained from a microdissected portion of breast 
cancer tissue enriched for cancer cells. The contrdl gene set may, for example, comprise S-actin* 
and ribosomal protein LPO. 

20 The report prepared for the use of the patient or the patient's physician, may include the 

identification of at least one drug potentially beneficial in the treatment of the patient. 

Step (b) of the foregoing method may comprise the step of determining the expression 
level of a gene specifically influencing cellular sensitivity to a drug, where the gene can, for 
example, be selected from the group consisting of aldehyde dehydrogenase 1A1, aldehyde 

25 dehydrogenase 1 A3, amphiregulin, ARG, BRK, BCRP, CD9, CD3 1, CD82/KAM , COX2, c-abl, 
c-kit, c-kit L, CYP1B1, CYP2C9, DHFR, dihydropyrimidine dehydrogenase, EGF, epiregulin, 
ER-alpha, ErbB-1, ErbB-2, ErbB-3, ErbB-4, ER-beta, farnesyl pyrophosphate synthetase, 
gamma-GCS (glutamyl cysteine synthetase), GATA3, geranyl geranyl pyrophosphate synthetase, 
Grb7, GST-alpha, GST-pi, HB-EGF, hsp 27, human chorionic gonadotropin/CGA, IGF-1, IGF-2, 

30 IGF1R, KDR, LIV1, Lung Resistance Protein/MVP, Lotl, MDR-1, microsomal epoxide 
hydrolase, MMP9, MRP1, MRP2, MRP3, MRP4, PAI1, PDGF-A, PDGF-B, PDGF-C, PDGF-D, 
PGDFR-alpha, PDGFR-beta, PLAGa (pleiomorphic adenoma 1), PREP prolyl endopeptidase, 
progesterone receptor, pS2/trefoil factor 1, RTEN, PTBlb, RAR-alpha, RAR-beta2, Reduced 

7 

SUBSTITUTE SHEET (RULE 26) 

BNSDOCID: <WO 03078662A1_I_> 



WO 03/078662 PCT/US03/07713 

Folate Carrier, SXR, TGF-alpha, thymidine phosphorylase, thymidine synthase, toppisomerase 
Il-alpha, topoisomerase n-beta, VEGF, XIST, and YB-1. 

In another embodiment, step (b) of the foregoing process includes determining the 
expression level of multidrug resistance factors, such as, for example, gamma-glutamyl-cysteine 
5 synthetase (GCS), GST-ct, GST-n, MDR-1, MRP 1-4, breast cancer resistance protein (BGRP), 
lung cancer resistance protein (MVP), SXR, or YB-1. 

In another embodiment step (b) of the foregoing process comprises determination of the 
expression level of eukaryotic translation initiation factor 4E (EIF4E). 

In yet another embodiment, step (b) of the foregoing process comprises determination of 
1 0 the expression level of a DNA repair enzyme. 

In a further embodiment, step (b) of the foregoing process comprises determination of the 
expression level of a cell cycle regulator, such as, for example, c-MYC, c-Src, Cyclin Dl, Ha- 
Ras, mdm2. pl4ARF, p21 WAF1/CI, pl6INK4a/pl4, p23, p27, p53, PI3K, PKC-epsilon, or 
PKC-delta. 

15 In a still further embodiment, step (b) of the foregoing process comprises determination of 

the expression level of a tumor suppressor or a related protein, such as, for example, APC or E- 

cadherin. ... 

In another embodiment, step (b) of the foregoing method comprises determination of the 
expression level of a gene regulating apoptosis, such as, for example, p53, BC12, Bcl-xl, Bak, 
20 Bax, and related factors, NFk-B, CIAP1, CIAP2, survivin, and related factors, p53BPl/ASPPl, 
orp53BP2/ASPP2. 

In yet another embodiment, step (b) of the foregoing process comprises determination of 
the expression level of a factor that controls cell invasion or angiogenesis, such as, for example, 
uPA, PAI1 , cathepsin B, C, and L, scatter factor (HGF), c-met, KDR, VEGF, or CD31 . 
25 In a different embodiment, step (b) of the foregoing method comprises determination of 

the expression level of a marker for immune or inflammatory cells or processes, such as, for 
example, Ig light chain X, CD18, GD3, CD68. Fas(CD95), or Fas Ligand. 

In a further embodiment, step (b) of the foregoing process comprises determination of the 
expression level of a cell proliferation marker, such as, for example, Ki67/MiBl, PCNA, Pinl, or 
30 thymidine kinase. 

In a still further embodiment, step (b) of the foregoing process comprises determination of 
the expression level of a growth factor or growth factor receptor., such as, for example, IGF1, 
IGF2, IGFBP3, IGF1R, FGF2, CSF-1, CSF-lR/fms, SCF-1, IL6 or IL8. 
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In another embodiment, step (b) of the foregoing process comprises determination of the 
expression level of a gene marker that defines a subclass of breast cancer, where the gene marker 
can, for example, be GROl oncogene alpha, Grt>7, cytokeratins 5 and 17, retinol binding protein 
4, hepatocyte nuclear factor 3, integrin subunit alpha 7, or lipoprotein lipase. 
5 In a still further aspect, the invention concerns a method for predicting the response of a 

patient diagnosed with breast cancer to 5-fluorouracil (5-FU) or an analog thereof, comprising 
the steps of: 

(a) subjecting RNA extracted from a breast cancer tissue obtained from the patient to 
gene expression analysis; 

10 (b) determining the expression level in the tissue of thymidylate synthase mRNA, 

wherein the expression level is normalized against a control gene or genes, and is compared to 
the amount found in a reference breast cancer tissue set; and 

(c) predicting patient response based on the normalized thymidylate synthase mRNA 

•level. 

15 Step (d) of the foregoing method can further comprise determining the expression level of 

dihydropyrimidine phosphorylase. 

In another embodiment, step (b) of the method can further comprise determining the 
expression level of thymidine phosphorylase. 

In yet another embodiment, a positive response to 5-FU or an analog thereof is predicted 

20 if: (i) normalized thymidylate synthase mRNA leyel determined in step (b) is at or below the IS* 
percentile; or (ii) the sum of normalized expression levels of thymidylate synthase and 
dihydropyrimidine phosphorylase determined in step (b) is at or below the 25 01 percentile; or (in) 
the sum of normalized expression levels of thymidylate synthase, dihydropyrimidine 
phosphorylase, plus thymidine phosphorylase determined in step (b) is at or below the 20 th 

25 percentile.. 

In a further embodiment, in step (b) of the foregoing method the expression level of c-myc 
and wild-type p53 is determined. In this case, a positive response to 5-FU or an analog thereof is 
predicted, if the normalized expression level of c-myc relative to the normalized expression level 
of wild-type p53 is in the upper 15 th percentile. 
30 In a still further embodiment, in step (b) of the foregoing method, expression level of 

NFkB and cIAP2 is determined. In this particular embodiment, resistance to 5-FU or an analog 
thereof is typically predicted if the normalized expression level of NFkB and cIAP2 is at or 
above the 10 th percentile. 
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In another aspect, the invention concerns a method for predicting the response of a patient 
diagnosed with breast cancer to methotrexate or an analog thereof, comprising the steps of: 

(a) subjecting RN A extracted from a breast cancer tissue obtained from the patient to 
gene expression analysis, wherein gene expression levels are normalized against a control gene 

5 or genes, and compared to the amount found in a reference breast cancer tissue set; and 

(b) predicting decreased patient sensitivity to methotrexate or analog if (i) DHFR 
levels are more than tenfold higher than the average expression level of DHFR in the control 
gene set, or (ii) the normalized expression levels of members of the reduced folate carrier (RFC) 
family are below the 10 th percentile. 

10 in yet another aspect, the invention concerns a method for predicting the response of a 

patient diagnosed with breast cancer to an anthracycline or an analog thereof, comprising the 
steps of: 

(a) subjecting RNA extracted from a breast cancer tissue obtained from the patient to 
gene expression analysis, wherein gene expression levels are normalized against a control gene 

15 or genes, and compared to the amount found in a reference breast cancer tissue set; and 

(b) predicting patient resistance or decreased sensitivity to the anthracycline or analog 
if (i) the normalized expression level of topoisomerase Hot is below the 10* percentile, or (ii) the 
normalized expression level of topoisomerase 110 is below the 10 th percentile, or (Hi) the : 
combined normalized topoisomerase Hot or 11(5 expression levels are below the 10 th percentile. 

20 In a different aspect, the invention concerns a method for predicting the response of a 

patient diagnosed with breast cancer to a docetaxol, comprising the steps of: 

(a) subjecting RNA extracted from a breast cancer tissue obtained from the patient to 
gene expression analysis, wherein gene expression levels are normalized against a control gene 
or genes, and compared to the amount found in a reference breast cancer tissue set; and 
25 (b) predicting reduced sensitivity to docetaxol if the normalized expression level of 

CYP1B1 is in the upper 10 th percentile. 

The invention further concerns a method for predicting the response of a patient 
diagnosed with breast cancer to cyclophosphamide or an analog thereof, comprising 

(a) subjecting RNA extracted from a breast cancer tissue obtained from the patient to 
30 gene expression analysis, wherein gene expression levels are normalized against a control gene 

or genes, and compared to the amount found in a reference breast cancer tissue set; and 

(b) predicting reduced sensitivity to the cyclophosphamide or analog if the sum of the 

expression levels of aldehyde dehydrogenase 1A1 and 1 A3 is more than tenfold higher than the 

average of their combined expression levels in the reference tissue set. 

10 
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In a further aspect, the invention concerns a method for predicting the response of a 
patient diagnosed with breast cancer to anti-estrogen therapy, comprising ' 
• . (a) subjecting RNA extracted from a breast cancer tissue obtained from the patient to 
gene expression analysis, wherein gene expression levels are normalized against a control gene 
5 or genes, and compared to the amount found in a reference breast cancer tissue set that contains 
both specimens negative for and positive for estrogen receptor-a (ERa) and progesterone 
receptor-a (PRa); and 

(b) predicting patient response based upon the normalized expression levels of ERa 
or PRa, and at least one of microsomal epoxide hydrolase, pS2/trefoil factor 1, GAT A3 and 
10 human chorionic gonadotropin. 

In a specific embodiment, lack of response or decreased responsiveness is predicted if (i) 
the normalized expression level of microsomal epoxide hydrolase is in the upper 10 th percentile; 
or (ii) the normalized expression level of pS2/trefoil factor 1, or GATA3 or human chorionic, 
gonaostropin is at or below the corresponding average expression level in said breast cancer 
15 tissue set, regardless of the expression level of ERa or PRa in the breast cancer tissue obtained 
from the patient. 

In another aspect, the invention concerns a method for predicting the response of a patient 
diagnosed with breast cancer to a taxane, comprising the steps of: 

(a) subjecting RNA extracted from a breast cancer tissue obtained from the patient to 
20 gene expression analysis, wherein gene expression levels are normalized against a control gene 

or genes, and compared to the amount found in a reference breast cancer tissue set; and 

(b) predicting reduced sensitivity to taxane if (i) no or minimal XIST expression is 
detected; or (ii) the normalized expression level of GST-71 or propyl endopeptidase (PREP) is in 
the upper 10 th percentile; or (iii) the normalized expression level of PLAG1 is in the upper 10 th 

25 percentile. 

The invention also concerns a method for predicting the response of a patient diagnosed 
with breast cancer to cisplatin or an analog thereof, comprising the steps of: . 

(a) subjecting RNA extracted from a breast cancer tissue obtained from the patient to 
gene expression analysis, wherein gene expression levels are normalized against a control gene 

30 or genes, and compared to the amount found in a reference breast cancer tissue set; and 

(b) predicting resistance or reduced sensitivity if the normalized expression level of 
ERCC1 is in the upper 1 0 th percentile. 

The invention further concerns a method for predicting the response of a patient 

diagnosed with breast cancer to an ErbB2 or EGFR antagonist, comprising the steps of: 
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(a) subjecting RNA extracted from a breast cancer tissue obtained from the patient to 
■ gene expression analysis, wherein gene expression levels are normalized against a control gene 

or genes, arid compared to tte amount found in a reference breast cancer tissue set; and 

(b) predicting patient response based on the normalized expression levels of at least 

5 one of Grb7, IGF1R, IGF1 and IGF2. 

In particular embodiment, a positive response is predicted if the normalized expression 
levei of Grb7 is in the upper 10 th percentile, and the expression of IGF 1R, 1GF1 and K3F2 is not 
elevated above the 90 th percentile. . 

In a further particular embodiment, a decreased responsiveness is predicted if the 
10 expression level of at least one of IGF1R, IGF1 and 1GF2 is elevated. 

In another aspect, the invention concerns a method for predicting the response of a patient 
diagnosed with breast cancer to a bis-phosphonate drug, comprising the steps of: 

(a) subjecting RNA extracted from a breast cancer tissue obtained from the patient to 
gene expression analysis, wherein gene expression levels are normalized against a control gene 

15 or genes, and compared to the amount found in a reference breast cancer tissue set; and 

(b) predicting- a positive response if me breast cancer tissue obtained from the patient 
expresses mutant Ha-Ras and additionally expresses famesyl pyrophosphate synthetase or 
gerariyl pyrophosphone synthetase at a normalized expression level at or above the 90 th 
percentile. 

20 In yet another aspect, the invention concerns a method for predicting the response of a 

patient diagnosed with breast cancer to treatment with a cyclooxygenase 2 inhibitor, comprising 
the steps of: 

(a) subjecting RNA extracted from a breast cancer tissue obtained from the patient to 
gene expression analysis, wherein gene expression levels are normalized against a control gene 

25 or genes, and compared to the amount found in a reference breast cancer tissue set; and 

(b) predicting a positive response if the normalized expression level of COX2 in the 
breast cancer tissue obtained from the patient is at or above the 90 th percentile. 

The invention further concerns a method for predicting the response of a patient 
diagnosed with breast cancer to an EGF receptor (EGFR) antagonist , comprising the steps of: 
30 (a) subjecting RNA extracted from a breast cancer tissue obtained from the patient to 

gene expression analysis, wherein gene expression levels are normalized against a control gene 
or genes, and compared to the amount found in a reference breast cancer tissue set; and 

(b) predicting a positive response to an EGFR antagonist, if (i) the normalized 
expression level of EGFR is at or above the 10 th percentile, and (ii) the normalized expression 
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level of at least one of epiregulin, TGF-d, amphiregulin, ErbB3, BRK, CD9, MMP9, CD82, and 
Lotl is above the 90 th percentile. 

In another aspect, the invention concerns a method for monitoring the response of a 
patient diagnosed with breast cancer to treatment with an EGFR antagonist, comprising 
5 monitoring the expression level of a gene selected from the group consisting of epirejgulin, TGF- 
a, amphiregulin, ErbB3, BRK, CD9, MMP9, CD82, and Lotl in the patient during treatment, - 
wherein reduction in the expression level is indicative of positive response to such treatment. 

In yet another aspect, the invention concerns a method for predicting the response of a 
patient diagnosed with breast cancer to a drug targeting a tyrosine kinase selected from the group 
10 consisting of abl, c-kit, PDGFR-a, PDGFR-p and ARG , comprising the steps of: 

(a) subjecting RNA extracted from a breast cancer tissue obtained from the patient to 
gene expression analysis, wherein gene expression levels are normalized against a control gene 
or genes, and compared to the amount found in a reference breast cancer tissue set; 

(b) determining the normalized expression level of a tyrosine kinase selected from the 
15 group consisting of abl, c-kit, PDGFR-a, PDGFR-p and ARG, and the cognate ligand of the 

tyrosine kinase, and if the normalized expression level of the tyrosine kinase is in the upper 10 th 
percentile, 

(c) determining whether the sequence of the tyrosine kinase contains any mutation, 
wherein a positive response is predicted if (i) the normalized expression level of the 

20 tyrosine kinase is in the upper 1 0 th percentile, (ii) the sequence of the tyrosine kinase contains an 
activating mutation, or (iii) the normalized expression level of the tyrosine kinase is normal and 
the expression level of the ligand is in the upper 10* percentile. 

Another aspect of the invention is a method for predicting the response of a patient 
diagnosed with breast cancer to treatment with an anti-angiogenic drug, comprising the steps of: 

25 (a) subjecting RNA extracted from a breast cancer tissue obtained from the patient to 

gene expression analysis, wherein gene expression levels are normalized against a control gene 
or genes, and compared to the amount found in a reference breast cancer tissue set; and 

(b) predicting a positive response if (i) the normalized expression level of VEGF is in 
the upper 10 th percentile and (ii) the normalized expression level of KDR or CD31 is in the upper 

30 20 th percentile. 

A further aspect of the invention is a method for predicting the likelihood that a patient 
diagnosed with breast cancer develops resistance to a drug interacting with the MRP-1 gene 
coding for the multidrug resistance protein P-glycoprotein, comprising the steps of: 
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(a) subjecting RNA extracted from a breast cancer tissue obtained from the patient to 
gene expression analysis to determine the expression level of PTPlb, wherein.the expression 
level is normalized against a control gene or genes, and compared to the amount found in a 
reference breast cancer tissue set; and 
5 (b) concluding that the patient is likely to develop resistance to said drug if the 

normalized expression level of the MRP-1 gene is above the 90* percentile. 

The invention further relates to a method for predicting the likelihood that a patient 
diagnosed with breast cancer develops resistance to a chemotherapeutic drug or toxin used in 
cancer treatment, comprising the steps of: 
10 (a) subjecting RNA extracted from a breast cancer tissue obtained from the patient to 

gerie expression analysis, wherein gene expression levels are normalized against a control gene 
or genes, and compared to the amount found in a reference breast cancer tissue set; and 

(b) determining the normalized expression levels of at least one of the following 
genes: MDR1, SGTct, GST*, SXR, BCRP YB-1, and LRP/MVP, wherein the finding of a 
15 normalized expression level in the upper 4 ,h percentile is an indication that the patient is likely to 
develop resistance to the drug. 

Also included herein is a method for measuring the translational efficiency of VEGF 
mRNA in a breast cancer tissue sample, comprising determining the expression levels of the 
VEGF and EIF4E miRNA in the sample, normalized against a control gene or genes, and 
20 compared to the amount found in a reference breast cancer tissue set, wherein a higher 
normalized EIF4E expression level for the same VEGF expression level is indicative of relatively 
higher translational efficiency for VEGF. 

In another aspect, the invention provides a method for predicting the response of a patient 
diagnosed with breast cancer to a VEGF antagonist, comprising determining the expression level 
25 of VEGF and EEF4E mRNA normalized against a control gene or genes, and compared to the 
amount found in a reference breast cancer tissue set, wherein a VEGF expression level above the 
90 th percentile and an EIF4E expression level above the 50 th percentile is a predictor of good 
patient response. 

The invention further provides a method for predicting the likelihood of the recurrence of 
30 breast cancer in a patient diagnosed with breast cancer, comprising determining the ratio of 
p53:p21 mRNA expression or p53:mdm2 mRNA expression in a breast cancer tissue obtained 
from the patient, normalized against a control gene or genes, and compared to the amount found 
in a reference breast cancer tissue set, wherein an above normal ratio is indicative of a higher risk 
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of recurrence. Typically, a higher risk of recurrence is indicated if the ratio is in the upper 10* 
percentile. 

In yet another aspect, the invention concerns a method for predicting the likelihood of the 
recurrence of breast cancer in a breast cancer patient following surgery, comprising determining 
5 the expression level of cyclin Dl in a breast cancer tissue obtained from the patient; normalized 
against a control gene or genes, and compared to the amount found in a reference breast cancer 
tissue set, wherein an expression level in the upper 10 th percentile indicates increased risk of 
recurrence following surgery, lii a particular embodiment of this method, the patient is subjected 
to adjuvant chemotherapy, if the expression level is in the upper 1 0 th percentile. 

10 Another aspect of the invention is a method for predicting the likelihood of the recurrence 

of breast cancer in a breast cancer patient following surgery, comprising determining the 
expression level of APC or E-cadheriri in a breast cancer tissue obtained from the patient, 
normalized against a control gene or genes, and compared to the amount found in a reference 
breast cancer tissue set, wherein an expression level in the upper 5 th percentile indicates high risk 

15 of recurrence following surgery, and heightened risk of shortened survival. 

A further aspect of the invention is a method for predicting the response of a patient 
diagnosed with breast cancer to treatment with a proapoptotic drug comprising determining the 
expression levels of BC12 and c-MYC in a breast cancer tissue obtained from the patient, 
normalized against a control gene or genes, and compared to the amount found in a reference 

20 breast cancer tissue set, wherein (i) a BC12 expression level in the upper 10 th percentile in the 
absence of elevated expression of c-MYC indicates good response, and (ii) a good response is 
not indicated if the expression level c-MYC is elevated, regardless of the expression level of 
BC12. 

A still further aspect of the invention is a method for predicting treatment outcome for a 
25 patient diagnosed with breast cancer, comprising the steps of: 

(a) subjecting RNA extracted from a breast cancer tissue obtained from the patient to 
gene expression analysis, wherein gene expression levels are normalized against a control gene 
or genes, and compared to the amount found in a reference breast cancer tissue set; and 

(b) determining the normalized expression levels of NFkB and at least one gene 
30 selected from the group consisting of cIAPl , cIAP2, XIAP, and Survivin, 

wherein a poor prognosis is indicated if the expression levels for NFkB and at least one 
of the genes selected from the group consisting of cIAPl, cIAP2, XIAP, and Survivin is in the 
upper 5 th percentile. 
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The invention further concerns a method for predicting treatment outcome for a patient 
diagnosed with breast cancer, comprising determining the expression levels of p53BPl and 
p53BP2 in a breast cancer tissue obtained from the patient, normalized against a control gene or 
genes, and compared to the amount found in a reference breast cancer tissue set, wherein a poor 
5 outcome is predicted if the expression level of either P 53BP1 or p53BP2 is in the lower 10 th 
percentile. 

The invention additionally concerns a method for predicting treatment outcome for a 
patient diagnosed with breast cancer, comprising determining the expression levels of uPA and 
PAI1 in a breast cancer tissue obtained from the patient, normalized against a control gene or 
10 genes, and compared to the amount found in a reference breast cancer tissue set, wherein (i) a 
poor outcome is predicted if the expression levels of uPA and PA11 are in the upper 20 th 
percentile, and (ii) a decreased risk of recurrence is predicted if the expression levels of uPA and 
PAI1 are not elevated above the mean observed in the breast cancer reference set In a particular 
embodiment, poor outcome is measured in terms of shortened survival or increased risk of cancer 
15 recurrence following surgery. In another particular embodiment, uPA and PAI1 are expressed at 
normal levels^ and the patient is subjected to adjuvant chemotherapy following surgery. 

Another aspect of the invention is a method for predicting treatment outcome in a patient 
diagnosed with breast cancer, comprising determining the expression levels of cathepsin B and 
cathepsin L in a breast cancer tissue obtained from the patient, normalized against a control gene 
20 or genes, and compared to the amount found in a reference breast cancer tissue set, wherein a 
poor outcome is predicted if the expression level of either cathepsin B or cathepsin L is in the 
upper 10 th percentile. Just as before, poor treatment outcome may be measured, for example, in 
terms of shortened survival or increased risk of cancer recurrence. 

A further aspect of the invention is a method for devising the treatment of a patient 
25 diagnosed with breast cancer, comprising the steps of 

(a) determining the expression levels of scatter factor and c-met in a breast cancer 
tissue obtained from the patient, normalized against a control gene or genes, and compared to the 
amount found in a reference breast cancer tissue set, and 

(b) suggesting prompt aggressive chemotherapeutic treatment if the expression levels 
30 of scatter factor and c-met or the combination of both, are above the 90 th percentile. 

A still further aspect of the invention is a method for predicting treatment outcome for a 
patient "diagnosed with breast cancer, comprising determining the expression levels of VEGF, 
CD31, and KDR in a breast cancer tissue obtained from the patient, normalized against a control 
gene or genes, and compared to the amount found in a reference breast cancer tissue set, wherein 
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a poor treatment outcome is predicted if the expression level of any of VECHF * CD3 1 , and KDR is 
in the upper 10 th percentile. 

Yet another aspect of the invention is a method for predicting treatment outcome for a 
patient diagnosed with breast cancer, comprising .determining the expression levels of 

5 Ki67/MiBl, PCNA, Pint, and thymidine kinase in a breast cancer tissue obtained from the 
patient, normalized against a control gene or genes, and compared to the amount found in a 
reference breast cancer tissue set, wherein a poor treatment outcome is predicted if the expression 
level of ariy of Ki67/MiB 1 , PCNA, Pin 1 , and thymidine kinase is in the upper 1 0 th percentile. 

The invention further concerns a method for predicting treatment outcome for a patient 

10 diagnosed with breast cancer, comprising determining the expression level of soluble and full 
length (CD95 in a breast cancer tissue obtained from the patient,, normalized against a control 
gene or genes, and compared to the amount found in a reference breast cancer tissue set, wherein 
the presence of soluble CD95 correlates with poor patient survival. 

The invention also concerns a method for predicting treatment outcome for a patient 

15 diagnosed with breast cancer, comprising determining the expression levels of IGF1, IGF1R and 
IGFBP3 in a breast cancer tissue obtained from the patient, normalized against a control gene or 
genes, and compared to the amount found in a reference breast cancer tissue set, wherein a poor 
treatment outcome is predicted if the sum of the expression levels of IGF1, IGF1R and IGFBP3 
is in the upper 10 th percentile. 

20 The invention additionally concerns a method for classifying breast cancer comprising, 

determining the expression level of two or more genes selected from the group consisting of 
Bcll2, hepatocyte nuclear factor 3, LIV1, ER, lipoprotein lipase, retinol binding protein 4, 
integrin a7, cytokeratin 5, cytokeratin 17, GRO oncogen, ErbB2 arid Grb7, in a breast cancer 
tissue, normalized against a control gene or genes, and compared to the amount found in a 

25 reference breast cancer tissue set, wherein (i) tumors expressing at least one of Bel 1 , hepatocyte 
nuclear factor 3, LIV1, and ER above the mean expression level in the reference tissue set are 
classified as having a good prognosis for disease free and overall patient survival following 
surgical removal; (ii) tumors characterized by elevated expression of at least one of lipoprotein 
lipase, retinol binding protein 4, integrin <x7 compared to the reference tissue set are classified as 

30 having intermediate prognosis of disease free and overall patient survival following surgical 
removal; and (iii) tumors expressing either elevated levels of cytokeratins 5 and 17, and GRO 
oncogen at levels four-fold or greater above the mean expression level in the reference tissue set, 
or ErbB2 and Grb7 at levels ten-fold or more above the mean expression level in the reference 
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tissue set are classified as having poor prognosis of disease free and overall patient survival 
following surgical removal. 

Another aspect of the invention is a panel of two or more gene specific primers selected 
from the group consisting of the forward and reverse primers listed in Table 2. 

5 Yet another aspect of the invention is a method for reverse transcription of a fragmented 

RNA population in RT-PCR amplification, comprising using a multiplicity of gene specific 
primers as the reverse primers in the amplification reaction. In a particular embodiment, the 
method uses between two and about 40,000 gene specific primers in the same amplification 
reaction. In another embodiment, the gene specific primers are about 18 to 24 bases, such as 

10 about 20 bases in length. In another embodiment, the fm of the primers is about 58-60 °C. The 
primers can, for example, be selected from the group consisting of the forward and reverse 

primers listed in Table 2. 

The invention also concerns a method of reverse transcriptase driven first strand cDNA 
synthesis, comprising using a gene specific primer of about 1 8 to 24 bases in length and having a 
15 Tm optimum between about 58 °C and about 60 °C. In a particular embodiment, the first strand 
cDNA synthesis is followed by PCR DNA amplification, and the primer serves as the reverse 
primer that drives the PCR amplification. In another embodiment, the method uses a plurality of 
gene specific primers in the same first strand cDNA synthesis reaction mixture. The number of 
the gene specific primers can, for example, be between 2 and about 40,000. 
20 In a different aspect, the invention concerns a method of predicting the likelihood of 

long-term survival of a breast cancer patient without the recurrence of breast cancer, following 
surgical removal of the primary tumor, comprising determining the expression level of one or 
more prognostic RNA transcripts or their product in a breast cancer tissue sample obtained from 
said patient, normalized against the expression level of all RNA transcripts or their products in 
25 said breast cancer tissue sample, or of a reference set of RNA transcripts or their products, 
wherein the prognostic transcript is the transcript of one or more genes selected from the group 
consisting of: FOXM1, PRAME, Bcl2, STK15, CEGP1, Ki-67, GSTM1, CA9, PR, BBC3, 
NME1, SURV, GATA3, TFRC, YB-1, DPYD, GSTM3, RPS6KB1, Src, Chkl, ID1, EstRl, p27, 
CCNB1, X1AP, Chk2, CDC25B, IGF1R, AK055699, P13KC2A, TGFB3, BAGI1, CYP3A4, 
30 EpCAM, VEGFC, pS2, hENTl, WISP1, HNF3A, NFKBp65, BRCA2, EGFR, TK1, VDR, 
Contig51037, pENTl, EPHX1, IF1A, DIABLO, CDH1, HIFla, IGFBP3, CTSB, and Her2, 
wherein overexpression of one or more of FOXM1, PRAME, STK15, Ki-67, CA9, NME1, 
SURV, TFRC, YB-1, RPS6KB1, Src, Chkl, CCNB1, Chk2, CDC25B, CYP3A4, EpCAM, 

VEGFC hENTl, BRCA2, EGFR TK1, VDR, EPHX1, IF1A, Contig51037, CDH1, HIFla, 
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IGFBP3, CTSB, Her2, and pENTl indicates a decreased likelihood of long-term survival without 
breast cancer recurrence, and the overexpression of one or more of Bcl2, CEGP1,. GSTM1, PR, 
BBC3, GATA3, DPYD, GSTM3, IDl, EstRl, p27, XIAP, IGF1R, AK055699, P13KC2A, 
TGFB3, BAGI1, pS2, WISP1 , HNF3A, NFKBp65, and DIABLO indicates an increased 
5 likelihood of long-term survival without breast cancer recurrence. 

In a particular embodiment of this method, the expression level of at least 2, preferably at 
least 5, more preferably at least 10, most preferably at least 15 prognostic transcipts or their 
expression products is determined. 

When the breast cancer is invasive breast carcinoma, including both estrogen receptor 

10 (ER) overexpressing (ER positive) and ER negative tumors, the analysis includes determination 
of the expression levels of the transcripts of at least two of the following genes, or their 
expression products: FOXM1, PRAME, Bcl2, STK15, CEGP1, Ki-67, GSTM1, PR, BBC3, 
NME1, SURV, GAT A3, TFRC, YB-1, DPYD, Src, CA9, Contig51037, RPS6K1 and Her2. 

When the breast cancer is ER positive invasive breast carcinoma, the analysis includes 

15 dtermination of the expression levels of the transcripts of at least two of the following genes, or 
their expression products: PRAME, Bcl2, FOXM1, DIABLO, EPHX1, HIF1A, VEGFC, Ki-67, 
IGF1R, VDR, NME1, GSTM3, Contig51037, CDC25B, CTSB, p27, GDH1, and IGFBP3 

Just as before, it is preferred to determine the expression levels of at least 5, more 
preferably at least 10, most preferably at least 15 genes, or their respective expression products. 

20 In a particular embodiment, the expression level of one or more prognostic RNA 

transcripts is determined, where RNA may, for example, be obtained from a fixed, wax- 
embedded breast cancer tissue specimen of the patient. The isolation of RNA can, for example, 
be carried out following any of the procedures described above or throughout the application, or 
by any other method known in the art. 

25 In yet another aspect, the invention concerns an array comprising polynucleotides 

hybridizing to the following genes: FOXM1, PRAME, Bcl2, STK15, CEGP1, Ki-67, GSTM1, 
PR, BBC3, NME1, SURV, GATA3, TFRC, YB-1, DPYD, CA9, Contig51037, RPS6K1 and 
Her2, immobilized on a solid surface. 

In a particular embodiment, the array comprosies polynucleotides hybridizing to the 

30 following genes: FOXM1, PRAME, Bcl2, STK15, CEGP1, Ki-67, GSTM1, CA9, PR, BBC3, 
NME1, SURV, GATA3, TFRC, YB-1, DPYD, GSTM3, RPS6KBK Src, Chkl, IDl, EstRl, p27, 
CCNB1, XIAP, Chk2, CDC25B, IGF1R, AK055699, P13KC2A, TGFB3, BAGI1, CYP3A4, 
EpCAM, VEGFC, pS2, hENTl, WISP1, HNF3A, NFKBp65, BRCA2, EGFR, TK1, VDR, 
Contig51037, pENTl, EPHX1, IF1A, CDH1, HIFla, IGFBP3, CTSB, Her2 and DIABLO. 
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In a further aspect, the invention concerns a method of predicting the likelihood of long- 
term survival of a patient diagnosed with invasive breast cancer, without the recurrence of breast 
cancer, following surgical removal of the primary tumor, comprising the steps of: 

(1) determining the expression levels of the RNA transcripts or the expression 
5 products of genes of a gene set selected from the group consisting of 

(a) Bcl2, cyclinGl, NFKBp65, NME1 , EPHX1 , TOP2B, DR5, TERC, Src, DIABLO; 

(b) Ki67, XIAP, hENTl, TS, CD9, p27, cyclinGl, pS2, NFKBp65, CYP3A4; 

(c) GSTM1, XIAP, Ki67, TS, cyclinGl, p27, CYP3A4, pS2, NFKBp65, 
ErbB3; 

10 (d) PR, NME1, XIAP, upa, cyclinGl, Contig51037, TERC, EPHX1, 
ALDH1 A3, CTSL; 

(e) CA9, NME1, TERC, cyclinGl , EPHXl, DPYD, Src, TOP2B, NFKBp65, 
' . VEGFC; 

(f) TFRC, XIAP, Ki67, TS, cyclinGl , p27, CYP3A4, pS2, ErbB3, NFKBp65; 
15 (g) Bcl2, PRAME, cyclinGl, FOXM1, NFKJBp65, TS, XIAP, Ki67, CYP3A4, 

p27; 

(h) FOXM1, cyclinGl, XIAP, Contig51037, PRAME, TS, Ki67, PDGFRa, 
p27,NFKBp65; 

(i) PRAME, FOXM1, cyclinGl, XIAP, Contig51037, TS, Ki6, PDGFRa, p27, 
20 NFKBp65; 

(j) Ki67, XIAP, PRAME, hENTl, contig51 037, TS,CD9, p27,ErbB3, cyclinGl; 
(k) STK15, XIAP, PRAME, PLAUR, p27, CTSL, CD18, PREP, p53, RPS6KB1; 
(1) GSTM1, XIAP, PRAME, p27, Contig51037, ErbB3, GSTp, EREG, DDI, 
PLAUR; 

25 (m) PR, PRAME, NME1, XIAP, PLAUR, cyclinGl, Contig51037, TERC, EPHX1, 
DR5; 

(n) CA9, FOXM1, cyclinGl, XIAP, TS, Ki67, NFKBp65, CYP3A4, GSTM3, 
p27; 

(o) TFRC, XIAP, PRAME, p27,Contig51 037, ErbB3, DPYD, TERC, NME1, 
30 VEGFC; and 

(p) CEGP1, PRAME, hENTl, XIAP, Contig51037, ErbB3, DPYD, NFKBp65, 
ID1,TS 
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in a breast cancer tissue sample obtained from said patient, normalized against the expression 
levels of all RNA transcripts or their products in said breast cancer tissue sample, or of a 
reference set of RNA transcripts or their products; 

(2) subjecting the data obtained in step (a) to statistical analysis; and 
5 (3) determining whether the likelihood of said long-term survival has increased or 

decreased. 

In a still further aspect, the invention concerns a method of predicting the likelihood of 
long-term survival of a patient diagnosed with estrogen receptor (ER)-positive invasive breast 
cancer, without the recurrence of breast cancer, following surgical removal of the primary tumor, 
10 comprising the steps of: 

(1) determining the expression levels of the RNA transcripts or the expression 
products of genes of a gene set selected from the group consisting of 
(a) PRAME, P 27, IGFBP2, HIF1A, TIMP2, ILT2, CYP3A4, ID1, EstRl, 

DIABLO; 

15 (b) Contig51037, EPHX1, KJ67, TIMP2, cyclinGl, DPYD, CYP3A4, TP, AIBl, 
CYP2C8; 

(c) Bcl2, hENTl, FOXM1 , Contig51 037, cyclinGl, Contig46653, PTEN, 
CYP3A4, TIMP2, AREG; 

(d) HIF1 A, PRAME, p27, IGFBP2, TIMP2, ILT2, CYP3A4, ID 1 , EstRl , 
20 DIABLO; 

(e) IGF1R, PRAME, EPHX1, Contig51037, cyclinGl, Bcl2,NMEl,PTEN,TBP, 
TIMP2; 

(£) FOXMLContigSlOS^VEGFCTBP.HIFl^DPmRADSlCDCRS, 
cyclinGl, BAG1; 

25 (g) EPHXl,Contig51037, Ki67,TIMP2, cyclinGl, DPYD, CYP3A4, TP, AIBl, .. 
CYP2C8; 

(h) Ki67, VEGFC, VDR, GSTM3, p27, upa, ITGA7, rhoC, TERC, Pinl ; 

(i) CDC25B, Contig5 1 037, hENTl , Bcl2, HLAG, TERC, NME 1 , upa, ID 1 , C YP; 
(j) VEGFC, KJ67, VDR, GSTM3, p27, upa, ITGA7, rhoC, TERC, Pinl; 

30 (k) CTSB, PRAME, p27, IGFBP2, EPHX1, CTSL, BAD, DR5, DCR3, XIAP; 
(1) DIABLO, KJ67, hENTl, TIMP2, ID1, p27, KRT19,.IGFBP2, TS, PDGFB; 
(m) p27, PRAME, IGFBP2, HIF1A, TIMP2, ILT2, CYP3A4, ID 1 , EstR 1 , 
DIABLO; 
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(n) CDH 1 ; PRAME, VEGFC; HBF1 A; DPYD, TIMP2, CYP3A4, EstRl , RBP4, 

• p27; , ; " • '• 

(6) IGFBP3, PRAME, p27; Bcl2, XIAP, EstRl, Ki67, TS, Src/VEGF; 
(p) GSTM3, PRAME, p27, IGFBP3, XIAP, FGF2, hENTl, PTEN, EstRl, APC; 
5 (q) hENTl, Bcl2, F0XM1, Contig51037, CyclinGl, Contig46653, PTEN, 
CYP3A4, T1MP2, AREG; 
(r) STK15, VEGFC, PRAME, p27,GCLC, hENTl, ID1,TIMP2, EstRl, MCPl; 
(s) NME1 , PRAM, p27, IGFBP3, XIAP, PTEN, hENTl , Bcl2, CYP3A4, HLAG; 
(t) VDR, Bcl2,p27, hENTl , p53, PI3KC2A, EIF4E, TFRC, MCM3, ID1 ; 
10 (u) EIF4E, Contig51037, EPHX1, cyclinGl, Bcl2, DR5, TBP, PTEN,NME1, 
.- ' HER2; 

(v) CCNB1 ; PRAME, VEGFC, HIF1A, hENTl, GCLC, TIMP2, ID1 , p27, upa; 
(w) ID1, PRAME, DIABLO, hENTl, p27, PDGFRa, NME1 , BIN1, BRCA1, TP; 
(x). FBX05, PRAME, IGFBP3, p27, GSTM3, hENTl, XIAP, FGF2, TS, PTEN; 
15 (y) GUS, HIA1A, VEGFC, GSTM3, DPYD, hENTl , EBX05, CA9, CYP, 
KRT18;and 

(z) Bclx, Bcl2, hENTl, Contig51037, HLAG, CD9, ID1, BRCA1, BDM1, 
HBEGF; 

(2) subjecting the data obtained in step (1) to statistical analysis; and 
20 (3) determining whether the likelihood of said long-term survival has increased or 

. decreased. 

In a different aspect, the invention concerns an array comprising polynucleotides 
hybridizing to a gene set selected from the group consisting of: 

(a) Bcl2, cyclinGl, NFKBp65,NMEl,EPHXl,TOP2B,DR5,TERC,Src, DIABLO; 
25 (b) Ki67, XIAP, hENTl, TS, CD9, p27, cyclinGl, pS2, NFKBp65, CYP3A4; 

(c) GSTM1 , XIAP, KJ67, TS, cyclinGl , p27, CYP3A4, pS2, NFKBp65, 
ErbB3; 

(d) PR, NME1, XIAP, upa, cyclinGl, Contig51037, TERC, EPHX1, 
ALDH1 A3, CTSL; 

30 (e) CA9,NME1, TERC, cyclinGl, EPHX1, DPYD, Src, TOP2B, NFKBp65, 
VEGFC; 

(f) TFRC, XIAP, Ki67, TS, cyclinGl , p27, CYP3A4, pS2, ErbB3, NFKBp65; 

(g) Bcl2, PRAME, cyclinGl, FOXMl,NFKBp65, TS, XIAP, Ki67, CYP3A4, 
p27; 
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(h) FOXM1, cyclinGl, XIAP, ContigS 1037, PRAME, TS, Ki67, PDGFRa, 
P 27,NFKBp65; 

(i) PRAME, FOXM1, cyclinGl, XlAP, Contig51037, TS, Ki6, PDGFRa, p27, 
NFKBp65; 

5 (j) Ki67, XIAP, PRAME, hENTl, contig51037, TS, CD9, p27^ ErbB3, cyclinGl ; 
(k) STK15, XIAP, PRAME, PLAUR, p27, CTSL, CD18, PREP, p53, RPS6KB1; 
(1) GSTM1 , XIAP, PRAME, p27, Contig51037, ErbB3, GSTp, EREG, ID1, 
PLAUR; 

(m) PR, PRAME, NME1, XIAP, PLAUR, cyclinGl, Contig5 1037, TERC, EPHX1, 
10 DR5; 

(n) CA9, FOXM1, cyclinGl, XIAP, TS, Ki67, NFKBp65, CYP3A4, GSTM3, 
. p27; ' 

(o) TFRC, XIAP, PRAME, p27, Contig51037, ErbB3, DPYD, TERC, NME1, 
VEGFC; and 

15 (p) CEGP1 , PRAME, hENTl , XIAP, Contig5 1 037, ErbB3, DPYD, NFKBp65, 
ID1,TS, 
immobilized on a solid surface. 

In an additional aspect, the invention concerns an array comprising polynucleotides 
hybridizing to a gene set selected from the group consisting of: 
20 (a) PRAME, p27, IGFBP2, HEF1A, TIMP2, ILT2, CYP3A4, ID1, EstRL 
DIABLO; 

(b) Contig51037, EPHX1, Ki67, TIMP2, cyclinGl, DPYD, CYP3A4, TP, AIB1, 
CYP2C8; 

(c) Bcl2, hENTl, FOXM1, Contig51037, cyclinGl, Contig46653, PTEN, 
25 CYP3A4, TIMP2, AREG; 

(d) HIFlA,PRAME,p27,IGFBP2,TIMP2,ILT2,CYP3A4,lDi,EstRl, 
DIABLO; 

(e) IGF1R, PRAME, EPHX1, Contig51037, cyclinGl, Bcl2, NME1, PTEN, TBP, 
TIMP2; 

30 (f) FOXMl,Contig51037, VEGFC, TBP, HIF1A, DPYD, RAD51C, DCR3, 
cyclinGl, BAG 1; 

(g) EPHX1 , ContigSl 037, KJ67, TIMP2, cyclinGl, DPYD, CYP3A4, TP, AIB1, 
CYP2C8; 

(h) Ki67, VEGFC, VDR, GSTM3, p27, upa, ITGA7, rhoC, TERC, Pinl; 
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(i) CDC25B, Contig51037, hENTl, Bcl2, HLAG, TERC, NME1 , upa, IDl, CYP; 
(j) VEQFC, Ki67, VDR, GSTM3, p27, upa, ITGA7, rhoC, TERC, Pinl; 
(k) CTSB, PRAME, p27, IGFBP2, EPHX1, CTSL, BAD, DR5 , DCR3, XIAP; 
(I) DIABLO, Ki67, hENTl , TIMP2, ID 1 , p27, KRT1 9, IGFBP2, TS, PDGFB; 
5 (m) p27, PRAME, IGFBP2, HIF1A, TIMP2, ILT2, CYP3A4, IDl, EstRl, 
DIABLO; - 
(n) CDH1; PRAME, VEGFC; HIF1A; DPYD, TIMP2, CYP3A4, EstRl, RBP4, 
• •' '•- p27; • . 

(o) IGFBP3, PRAME, p27, Bcl2, XIAP, EstRl , Ki67, TS, Src, VEGF; 
10 (p) GSTM3, PRAME, p27, IGFBP3, XIAP, FGF2, hENTl , PTEN, EstRl , APC; 
(q) hENTl, Bcl2, FOXMl, Contig51037, CyclinGl, Contig46653, PTEN, 

CYP3A4, TIMP2, AREG; 
(r) STK15, VEGFC, PRAME, p27, GCLC, hENTl, ID1, TIMP2, EstRl, MGP1; 
(s) NME1, PRAM, p27, IGFBP3, XIAP, PTEN, hENTl, Bcl2, CYP3A4, HLAG; 
15 (t) VDR, Bcl2, p27, hENTl, p53, PI3KC2A, EIF4E, TFRG, MCM3, ID1; 

(u) E1F4E, Contig51037, EPHXl, cyclinGl, Bcl2, DR5, TBP, PTEN, NME1 , 
HER2; 

(v) CCNB1, PRAME, VEGFC, HIF1 A, hENTl, GCLC,TIMP2, IDl, p27, upa; 
(w) IDl, PRAME, DIABLO, hENTl, p27, PDGFRa,NMEl,BINl,BRCAl, TP; 
20 (x) FBX05, PRAME, IGFBP3, p27, GSTM3, hENTl , XIAP, FGF2, TS, PTEN; 
(y) GUS,HIA1A, VEGFC, GSTM3, DPYD, hENTl, FBX05,CA9, CYP, 
KRT18;and 

(z) Bclx,Bcl2, hENTl, Contig51037, HLAG, CD9, IDl, BRCA1,BIN1, 
HBEGF, 

25 immobilized on a solid surface. 

In all aspects, the polynucleotides can be cDNAs ("cDNA arrays") that are typically about 
500 to 5000 bases long, although shorter or longer cDNAs can also be used and are within the 
scope of this invention. Alternatively, the polynucleotids can be oligonucleotides (DNA 
microarrays), which are typically about 20 to 80 bases long, although shorter and longer 

30 oligonucleotides are also suitable and are within the scope of the invention. The solid surface 
can, for example, be glass or nylon, or any other solid surface typically used in preparing arrays, 
such as microarrays, and is typically glass. 
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Brief Description of the Drawings 
Figure 1 is a chart illustrating the overall workflow of the process of the invention for 
measurement of gene expression. In the Figure, FPET stands for "fixed paraffin-embedded 
tissue," and "RT-PCR" stands for "reverse transcriptase PCR." RN A concentration is 
5 determined by using the commercial RiboGreen™ RNA Quantitation Reagent and Protocol. 

Figure 2 is a flow chart showing the steps of an RNA extraction method according to the 
invention alongside a flow chart of a representative commercial methods 

Figure 3 is a scheme illustrating the steps of an improved method for preparing 
fragmented mRNA for expression profiling analysis. 
10 Figure 4 illustrates methods for amplification of RNA prior to RT-PCR. 

Figure 5 illustrates an alternative scheme for repair and amplification of fragmented 
■ ■ mRNA. " . 

Figure 6 shows the measurement of estrogen receptor mRNA levels in 40 FPE breast 
cancer specimens via RT-PCR. Three 10 micron sections were used for each measurement. 
15 Each data point represents the average of triplicate measurements. 

Figure 7 shows the results of the measurement of progesterone receptor mRNA levels in 
40 FPE breast cancer specimens via RT-PCR performed as described in the legend of Figure 6 
above. 

Figure 8 shows results from an IVT/RT-PCR experiment. 
20 Figure 9 is a representation of the expression of 92 genes across 70 FPE breast cancer 

specimens. The y-axis shows expression as cycle threshold times. These genes are a subset of 
the genes listed in Table 1 . 

Table 1 shows a breast cancer gene list. 

Table 2 sets forth amplicon and primer sequences used for amplification of fragmented 
25 mRNA. 

Table 3 shows the Accession Nos. and SEQ ID NOS of the breast cance genes examined. 
Detailed Description of the Preferred Embodiment 

A. Definitions 

30 Unless defined otherwise, technical and scientific , terms used herein have the same 

meaning as commonly understood by one of ordinary skill in the art to which this invention 
belongs. Singleton et al 9 Dictionary of Microbiology and Molecular Biology 2nd ed., J. Wiley & 
Sons (New York, NY 1994), and March, Advanced Organic Chemistry Reactions, Mechanisms 
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and Structure 4th ed., John Wiley & Sons (New York, NY 1992), provide one skilled in the art 
with a general guide to many of the terms used in the present application. 

One skilled in the art will recognize many methods and materials similar or equivalent to 
those described herein, which could be used in the practice of the present invention. Indeed, the 
5 present invention is in no way limited to the methods and materials described. For purposes of 
the present invention, the following terms are defined below. 

The term "rnicroarray" refers to an ordered arrangement of hybridizable array elements, 
preferably polynucleotide probes, on a substrate. 

The term ••polynucleotide," when used in singular or plural, generally refers to any 
10 polyribonucleotide or polydeoxribonucleotide, which may be unmodified RNA or DNA or 
modified RNA or DNA. Thus, for instance, polynucleotides as defined herein include, Without 
limitation, single- and double-stranded DNA, DNA including single- and double-stranded 
regions, single- and double-stranded RNA, and RNA including single- and double-stranded 
regions, hybrid molecules comprising DNA and RNA that may be single-stranded or, more 
15 typically, double-stranded or include single- and double-stranded regions. In addition, the term 
"polynucleotide" as used herein refers to triple-stranded regions comprising RNA or DNA or 
both RNA and DNA. The strands in such regions may be from the same molecule or from 
different molecules. The regions may include all of one or more of the molecules, but more 
typically involve only a region of some of the molecules. One of the molecules of a triple-helical 
20 region often is an oligonucleotide. The term "polynucleotide" specifically includes DNAs and 
RNAs that contain one or more modified bases. Thus, DNAs or RNAs with backbones modified 
for stability or for other reasons are "polynucleotides" as that term is intended herein. Moreover, 
DNAs or RNAs comprising unusual bases, such as inosine, or modified bases, such as tritiated 
bases, are included within the term "polynucleotides" as defined herein. In general, the term 
25 "polynucleotide" embraces all chemically, enzymatically and/or metabolically modified forms of 
unmodified polynucleotides, as well as the chemical forms of DNA and RNA characteristic of 
viruses and cells, including simple and complex cells. 

The term "oligonucleotide" refers to a relatively short polynucleotide, including, without 
limitation, single-stranded deoxyribonucleotides, single- or double-stranded ribonucleotides, 
30 RNA.DNA hybrids and double-stranded DNAs. Oligonucleotides, such as single-stranded DNA 
probe oligonucleotides, are often synthesized by chemical methods, for example using automated 
oligonucleotide synthesizers that are commercially available. However, oligonucleotides can be 
made by a variety of other methods, including in vitro recombinant DNA-mediated techniques 

and by expression of DNAs in cells and organisms. 
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The terms "differentially expressed gene," "differential gene expression" and their 
synonyms, which are used interchangeably, refer to a gene whose expression is activated to a 
higher or lower level in a subject suffering from a disease, specifically cancer, such as breast 
cancer, relative to its expression in a normal or control subject. The terms also include genes 
5 whose expression is activated to a higher or lower level at different stages of the same disease. It 
is also understood that a differentially expressed gene may be either activated or inhibited at the 
nucleic acid level or protein level, or may be subject to alternative splicing to result in a different 
polypeptide product. Such differences may be evidenced by a change in mRNA levels, surface 
expression, secretion or other partitioning of a polypeptide, for example. Differential gene 
10 expression may include a comparison of expression between two or more genes, or a comparison 
of the ratios of the expression between two or more genes, or even a comparison of two 
differently processed products of the same gene, which differ between normal subjects and 
subjects suffering from a disease, . specifically cancer, or between various stages of the same 
disease. Differential expression includes both quantitative, as well as qualitative, differences in 
15 the temporal or cellular expression pattern in a gene or its expression products among, for 
example, normal and diseased cells, or among cells which have undergone different disease 
events or disease stages. For the purpose of this invention, "differential gene expression" is 
considered to be present when there is at least an about two-fold, preferably at least about four- 
fold, more preferably at least about six-fold, most preferably at least about ten-fold difference 
20 between the expression of a given gene in normal and diseased subjects, or in various stages of 
disease development in a diseased subject. 

The phrase "gene amplification" refers to a process by which multiple copies of a gene or 
gene fragment are formed in a particular cell or cell line. The duplicated region (a stretch of 
amplified DNA) is often referred to as "amplicon." Usually, the amount pf the messenger RNA 
25 (mRNA) produced, the level of gene expression, also increases in the proportion of the 
number of copies made of the particular gene expressed. 

The term "prognosis" is used herein to refer to the prediction of the likelihood of cancer- 
attributable death or progression, including recurrence, metastatic spread, and drug resistance, of 
a neoplastic disease, such as breast cancer. The term "prediction" is used herein to refer to the 
30 likelihood that a patient will respond either favorably or unfavorably to a drug or set of drugs, 
and also the extent of those responses. The predictive methods of the present invention can be 
used clinically to make treatment decisions by choosing the most appropriate treatment 
modalities for any particular patient The predictive methods of the present invention are 
valuable tools in predicting if a patient is likely to respond favorably to a treatment regimen, such 
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as surgical intervention, chemotherapy with a given drug or drug combination, and/or radiation 
therapy. 

The term "increased resistance" to a particular drug or treatment option, when used in 
accordance with the present invention, means decreased response to a standard dose of the drug 
5 or to a standard treatment protocol. 

The term "decreased sensitivity" to a particular drug or treatment option, when used in 
accordance with the present invention, means decreased response to a standard dose of the drug 
or to a standard treatment protocol, where decreased response can be compensated for (at least 
partially) by increasing the dose of drag, or the intensity of treatment. 
10 "Patient response" can be assessed using any endpoint indicating a benefit to the patient, 

including, without limitation, (1) inhibition, to some extent, of tumor growth, including slowing 
down and complete growth arrest; (2) reduction in the number of tumor cells; (3) reduction in 
tumor size; (4) inhibition (i.e., reduction, slowing down or complete stopping) of tumor cell 
infiltration into adjacent peripheral organs and/or tissues; (5) inhibition (i.e. reduction, slowing 
15 down or complete stopping) of metastasis; (6) enhancement of anti-tumor immune response, 
which may, but does not have to, result in the regression or rejection of the tumor; (7) relief, to 
some extent, of one or more symptoms associated with the tumor; (8) increase in the length of 
survival following treatment; and/or (9) decreased mortality at a given point of time following 
treatment. 

20 The term "treatment" refers to both therapeutic treatment and prophylactic or preventative 

measures, wherein the object is to prevent or slow down (lessen) the targeted pathologic 
condition or disorder. Those in need of treatment include those already with the disorder as well 
as those prone to have the disorder or those in whom the disorder is to be prevented. In tumor 
(e.g., cancer) treatment, a therapeutic agent may directly decrease the pathology of tumor cells, or 

25 render the tumor cells more susceptible to treatment by other therapeutic agents, e.g., radiation 
and/or chemotherapy. 

The term "tumor," as used herein, refers to all neoplastic cell growth and proliferation, 
whether malignant or benign, and all pre-cancerous and cancerous cells and tissues. 

The terms "cancer" and "cancerous" refer to or describe the physiological condition in 
30 mammals that is typically characterized by unregulated cell growth. Examples of cancer include 
but are not limited to, breast cancer, colon cancer, lung cancer, prostate cancer, hepatocellular 
cancer, gastric cancer, pancreatic cancer, cervical cancer, ovarian cancer, liver cancer, bladder 
cancer, cancer of the urinary tract, thyroid cancer, renal cancer, carcinoma, melanoma, and brain 
cancer. 
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The "pathology" of cancer includes all phenomena that compromise the well-being of the 

patient. '. This includes, without limitation, abnormal or uncontrollable cell growth, metastasis, 

interference with the normal functioning of neighboring cells, release of cytokines or other 

secretory products at abnormal levels, suppression or aggravation of inflammatory or 

5 immunological response, neoplasia, premalignancy, malignancy, invasion of surrounding or 

distant tissues or organs, such as lymph nodes, etc. 

"Stringency" of hybridization reactions is readily determinable by one of ordinary skill in 

the art, and generally is an empirical calculation dependent upon probe length, washing 

temperature, and salt concentration. In general, longer probes require higher temperatures for 

10 proper annealing, while shorter probes need lower temperatures. Hybridization generally 

depends on the ability of denatured DNA to reanneal when complementary strands are present in 

an environment below their melting temperature. The higher the degree of desired homology 

between the probe and hybridizable sequence, the higher the relative temperature which can be 

used. As a result, it follows that higher relative temperatures would tend to make the reaction 

15 conditions more stringent, while lower temperatures less so. For additional details and 

explanation of stringency of hybridization reactions, see Ausubel et al., Current Protocols in 

Molecular Biology, Wiley Interscience Publishers, (1995). 

"Stringent conditions" or "high stringency conditions", as defined herein, typically: (1) 

employ low ionic strength and high temperature for washing, for example 0.015 M sodium 

20 chloride/0.0015 M sodium citrate/0.1% sodium dodecyl sulfate at 50°C; (2) employ during 

hybridization a denaturing agent, such as formamide, for example, 50% (v/v) formamide with 

0.1% bovine serum albumin/0.1% Ficoll/0.1% polyvinylpyrrolidone/50mM sodium phosphate 

buffer at pH 6.5 with 750 mM sodium chloride, 75 mM sodium citrate at 42°C; or (3) employ 

50% formamide, 5 x SSC (0.75 M NaCl, 0.075 M sodium citrate), 50 mM sodium phosphate (pH 

25 6.8), 0.1% sodium pyrophosphate, 5 x Denhardt's solution, sonicated salmon sperm DNA (50 

^ig/ml), 0.1% SDS, and 10% dextran sulfate at 42°C, with washes at 42°C in 0.2 x SSC (sodium 

chloride/sodium citrate) and 50% formamide at 55°C, followed by a high-stringency wash 

consisting of 0. 1 x SSC containing EDTA at 55°C. 

"Moderately stringent conditions" may be identified as described by Sambrook et al., 

30 Molecular Cloning: A Laboratory Manual . New York: Cold Spring Harbor Press, 1989, and 

include the use of washing solution and hybridization conditions (e.g., temperature, ionic 

strength and %SDS) less stringent that those described above. An example of moderately 

stringent conditions is overnight incubation at 37°C in a solution comprising: 20% formamide, 5 

x SSC (150 mM NaCl, 15 mM trisodium Ht ra te), 50 mM sodium phosphate (pH 7.6), 5 x 
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Denhardt's solution, 10% dextran sulfate, and 20 mg/ml denatured sheared salmon sperm DNA, 
followed by washing the filters in 1 x SSC at about 37-50°C. the skilled artisan will recognize 
how to adjust the temperature, ionic strength, etc. as necessary to accommodate factors such as 
probe length and the like. 

5 In the context of the present invention, reference to "at least one," "at least two," "at least 

five," etc. of the genes listed in any particular gene set means any one or any and all 
combinations of the genes listed. 

The terms "splicing" and "RNA splicing" are used interchangeably and refer to RNA 
processing that removes introns and joins exons to produce mature mRNA with continuous 
10 coding sequence that moves into the cytoplasm of an eukaryotic cell. 

In theory, the term "exon" refers to any segment of an interrupted gene that is represented 
in the mature RNA product (B. Lewin. Genes IV Cell Press, Cambridge Mass. 1990). In theory 
the term "intron" refers to any segment of DNA that is transcribed but removed from within the 
transcript by splicing together the exons on either side of it. Operationally, exon sequences occur 
15 in the mRNA sequence of a gene as defined by Ref. Seq ID numbers. Operationally, intron 
sequences are the intervening sequences within the genomic DNA of a gene, bracketed by exon 
sequences and having GT and AG splice consensus sequences at their 5' and 3' boundaries. 
B. Detailed Description 

The practice of the present invention will employ, unless otherwise indicated, 
20 conventional techniques of molecular biology (including recombinant techniques), microbiology, 
cell biology, and biochemistry, which are within the skill of the art. Such techniques are 
explained fully in the literature, such as, "Molecular Cloning: A Laboratory Manual", 2 nd edition 
(Sambrook et al., 1989); "Oligonucleotide Synthesis" (M.J. Gait, ed., 1984); "Animal Cell 
Culture" (R.I. Freshney, ed., 1987); "Methods in Enzymology" (Academic Press, Inc.); 
25 "Handbook of Experimental Immunology", 4 ,h edition (D.M. Weir & C.C. Blackwell, eds., 
Blackwell Science Inc., 1987); "Gene Transfer Vectors for Mammalian Cells" (J.M. Miller & 
M P. Calos, eds., 1987); "Current Protocols in Molecular Biology" (F.M. Ausubel et al., eds., 
1987); and "PCR: The Polymerase Chain Reaction", (Mullis et al., eds., 1994). 
1. Gene Expression Profiling 
30 In general, methods of gene expression profiling can be divided into two large groups: 

methods based on hybridization analysis of polynucleotides, and methods based on sequencing of 
polynucleotides. The most commonly used methods known in the art for the quantification of 
mRNA expression in a sample include northern blotting and in situ hybridization (Parker & 
Barnes, Methods in Molecular Biology 106:247-283 (1999)); RNAse protection assays (Hod, 
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Biotechniques 13:852-854 (1992)); and reverse transcription polymerase chain reaction (RT- 
PCR) (Weis et al, Trends in Genetics 8:263-264 (1992)). Alternatively, antibodies may be 
employed that can recognize specific duplexes, including DNA duplexes, RNA duplexes, and 
DNA-RNA hybrid duplexes or DNA-protein duplexes. Representative methods for sequencing- 
5 based gene expression analysis include Serial Analysis of Gene Expression (SAGE), and gene 
expression analysis by massively parallel signature sequencing (MPSS). 
2. Reverse Transcriptase PCR fRT-PCR) 

Of the techniques listed above, the most sensitive arid most flexible quantitative method is 
RT-PCR, which can be used to compare mRNA levels in different sample populations, in normal 
10 and tumor tissues, with or without drug treatment, to characterize patterns of gene expression, to 
discriminate between closely related mRNAs, and to analyze RNA structure. 

The first step is the isolation of mRNA from a target sample. The starting material is 
typically total RNA isolated from human tumors or tumor cell lines, awd corresponding normal 
tissues or cell lines, respectively. Thus RNA can be isolated from a variety of primary tumors, 
15 including breast, lung, colon, prostate, brain, liver, kidney, pancreas, spleen, thymus, testis, 
ovary, uterus, etc., tumor, or tumor cell lines, with pooled DNA from healthy donors. If the 
source of mRNA is a primary tumor, mRNA can be extracted, for ex:ample, from frozen or 
archived paraffin-embedded and fixed (e.g. formalin-fixed) tissue samples. 

General methods for mRNA extraction are well known in the art and are disclosed in 
20 standard textbooks of molecular biology, including Ausubel et aL 9 Current Protocols of 
Molecular Biology . John Wiley and Sons (1997). Methods for RNA extraction from paraffin 
embedded tissues are disclosed, for example, in Rupp and Locker, Lab Invest. 56:A67 (1987), 
and De Andres et al, BioTechniques 18:42044 (1995). In particular, RNA isolation can be 
performed using purification kit, buffer set and protease from commercial manufacturers, such as 
25 Qiagen, according to the manufacturer's instructions. For example, total RNA from cells in 
culture can be isolated using Qiagen RNeasy mini-columns. Other commercially available RNA 
isolation kits include MasterPure™ Complete DNA and RNA Purification Kit (EPICENTRE®, 
Madison, WI), and Paraffin Block RNA Isolation Kit (Ambion, Inc.). Total RNA from tissue 
samples can be isolated using RNA Stat-60 (Tel-Test). RNA prepared from tumor can be 
30 isolated, for example, by cesium chloride density gradient centrifugation. 

As RNA cannot serve as a template for PCR, the first step in gene expression profiling by 
RT-PCR is the reverse transcription of the RNA template into cDNA, followed by its exponential 
amplification in a PCR reaction. The two most commonly used reverse transcriptases are avilo 
myeloblastosis virus reverse transcriptase (AMV-RT) and Moloney murine leukemia virus 
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reverse transcriptase (MMLV-RT). The reverse transcription step is typically primed using 
specific primers, random hexamers, or oligo-dT primers, depending on the circumstances and the 
goal of expression profiling. For example, extracted RNA can be reverse-transcribed using a 
GeneAmp RNA PCR kit (Perkin Elmer, CA, USA), following the manufacturer's instructions. 

5 The derived cDNA can then be used as a template in the subsequent PCR reaction. 

Although the PCR step can use a variety of thermostable DNA-dependent DNA 
polymerases, it typically employs the Taq DNA polymerase, which has a 5'-3' nuclease activity 
but lacks a 3'-5* proofreading endonuclease activity. Thus, TaqMan® PCR typically utilizes the 
5'-nuclease activity of Taq or Tth polymerase to hydrolyze a hybridization probe bound to its 

10 target amplicon, but any enzyme with equivalent 5' nuclease activity can be used. Two 
oligonucleotide primers are used to generate an amplicon typical of a PCR reaction; A third 
oligonucleotide, or probe, is designed to detect nucleotide sequence located between the two 
PCR primers. The probe is non-extendible by Taq DNA polymerase enzyme, and is labeled with 
a reporter fluorescent dye and a quencher fluorescent dye. Any laser-induced emission from the 

15 reporter dye is quenched by the quenching dye when the two dyes are located close together as 
they are on the probe. During the amplification reaction, the Taq DNA polymerase enzyme 
cleaves the probe in a template-dependent manner. The resultant probe fragments disassociate in 
solution, and signal from the released reporter dye is free from the quenching effect of the second 
fluorophore. One molecule of reporter dye is liberated for each new molecule synthesized, and 

20 detection of the uriquenched reporter dye provides the basis for quantitative interpretation of the 
data. 

TaqMan® RT-PCR can be performed using commercially available equipment, such as, 
for example, ABI PRISM 7700™ Sequence Detection System™ (Perkin-Elmer-Applied 
Biosystems, Foster City, CA, USA), or Lightcycler (Roche Molecular Biochemicals, Mannheim, 

25 Germany). In a preferred embodiment, the 5' nuclease procedure is run on a real-time 
quantitative PCR device such as the ABI PRISM 7700™ Sequence Detection System™. The 
system consists of a thermocycler, laser, charge-coupled device (CCD), camera and computer. 
The system amplifies samples in a 96-well format on a thermocycler. During amplification, 
laser-induced fluorescent signal is collected in real-time through fiber optics cables for all 96 

30 wells, and detected at the CCD. The system includes software for running the instrument and for 
analyzing the data. 

5'-Nuclease assay data are initially expressed as Ct, or the threshold cycle. As discussed 
above, fluorescence values are recorded during every cycle and represent the amount of product 
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amplified to that point in the amplification reaction. The point when the fluorescent signal is 
first recorded as statistically significant is the threshold cycle (Q^ 

To minimize errors and the effect of sample-to-sample variation, RT-PCR is usually 
performed using an internal standard. The ideal internal standard is expressed at a constant level 
5 among different tissues, and is unaffected by the experimental treatment. RN As most frequently 
used to normalize patterns of gene expression are mRNAs for the housekeeping genes 
glyceraldehyde-3-phosphate-dehydrogenase (GAPDH) and (3-actin. 

A more recent variation of the RT-PCR technique is the real time quantitative PCR, 
which measures PCR product accumulation through a dual-labeled fluorigenic probe (i.e., 
10 TaqMan® probe). Real time PCR is compatible both with quantitative competitive PCR, where 
internal competitor for each target sequence is used for normalization, and with quantitative 
comparative PCR using a normalization gene contained within the sample, or a housekeeping 
gene for RT-PCR. For further details see, e.g. Held et ql, Genome Research 6:986-994 (1996). 
3. Microarravs 

15 Differential gene expression can also be identified, or confirmed using the microarray 

technique. Thus, the expression profile of breast cancer-associated genes can be measured in 
either fresh or paraffin-embedded tumor tissue, using microarray technology. In this method, 
polynucleotide sequences of interest are plated, or arrayed, on a microchip substrate. The arrayed 
sequences are then hybridized with specific DN A probes from cells or tissues of interest. Just as 

20 in the RT-PCR method, the source of mRNA typically is total RNA isolated from human tumors 
or tumor cell lines, and corresponding normal tissues or cell lines. Thus RNA can be isolated 
from a variety of primary tumors or tumor cell lines. If the source of mRNA is a primary tumor, 
mRNA can be extracted, for example, from frozen or archived paraffin-embedded and fixed (e.g. 
formalin-fixed) tissue samples, which are routinely prepared and preserved in everyday clinical 

25 practice. 

In a specific embodiment of the microarray technique, PCR amplified inserts of cDNA 
clones are applied to a substrate in a dense array. Preferably at least 10,000 nucleotide sequences 
are applied to the substrate. The microarrayed genes, immobilized on the microchip at 10,000 
elements each, are suitable for hybridization under stringent conditions. Fluorescently labeled 
30 cDNA probes may be generated through incorporation of fluorescent nucleotides by reverse 
transcription of RNA extracted from tissues of interest. Labeled cDNA probes applied to the 
chip hybridize with specificity, to each spot of DNA on the array. After stringent washing to 
remove non-specifically bound probes, the chip is scanned by confocal laser microscopy or by 
another detection method, such as a CCD camera. Quantitation of hybridization of each arrayed 
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element allows for assessment of corresponding mRNA abundance. With dual color 
fluorescence, separately labeled cDNA probes generated from two sources of RNA are 
hybridized pairwise to the array. The relative abundance of the transcripts from the two sources 
corresponding to each specified gene is thus determined simultaneously. The miniaturized scale 
> of the hybridization affords a convenient arid rapid evaluation of the expression pattern for large 
numbers of genes. Such methods have been shown to have the sensitivity required to detect rare 
transcripts, which are expressed at a few copies per cell, and to reproducibly detect at least 
approximately two-fold differences in the expression levels (Schena et al, Proc. Natl Acad. Set 
USA 93(2): 106-149 (1996)). Microarray analysis can be performed by commercially available 
D equipment, following manufacturer's protocols, such as by using the Asymetrix GenChip 
technology, or Incyte's microarray technology. 

The development of microarray methods for large-scale analysis' of gene expression 
makes it possible to search systematically for molecular markers of cancer classification and 
outcome prediction in a variety of tumor types. 
5 4. Serial Analysis of Gene Express ion {SAGE) 

Serial analysis of gene expression (SAGE) is a method that allows the simultaneous and 
quantitative analysis of a large number of gene transcripts, without the need of providing an 
individual hybridization probe for each transcript. First, a short sequence tag (about 10- 14 -bp) is 
generated that contains sufficient information to uniquely identify a transcript, provided that the 
:0 tag is obtained from a unique position within each transcript. Then, many transcripts are linked 
together to form long serial molecules, that can be sequenced, revealing the identity of the 
multiple tags simultaneously. The expression pattern of any population of transcripts can be 
quantitatively evaluated by determining the abundance of individual tags, and identifying the 
gene corresponding to each tag. For more details see, e.g. Velculescu et al, Science 270:484-487 
25 (1 995); and Velculescu et al, Cell 88:243-51 (1997). 

5. Gene Expression Analysis by Massively Parallel Signature Se quencing (MPSS) 
This method, described by Brenner et al, Nature Biotechnology 18:630-634 (2000), is a 
sequencing approach that combines non-gel-based signature sequencing with in vitro cloning of 
millions of templates on separate 5 |nm diameter microbeads. First, a microbead library of DNA 
30 templates is constructed by in vitro cloning. This is followed by the assembly of a planar array of 
the template-containing microbeads in a flow cell at a high density (typically greater than 3 x 10 6 
microbeads/cm 2 ). The free ends of the cloned templates on each microbead are analyzed 
simultaneously, using a fluorescence-based signature sequencing method that does not require 
DNA fragment separation. This method has been shown to simultaneously and accurately 
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provide, in a single operation, hundreds of thousands of gene signature sequences from a yeast 
cDNA library. 

6. General Description of the mRNA Isolation. Purification and Amplification 
Methods of the Invention 

5 The steps of a representative protocol of the invention, including mRNA isolation, 

purification, primer extension and amplification are illustrated in Figure 1. As shown in Figure 
1, this representative process starts with cutting about 10 \im thick sections of paraffin-embedded 
tumor tissue samples. The RNA is then extracted, and protein and DNA are removed, following 
the method of the invention described below. After analysis of the RNA concentration, RNA 
10 repair and/or amplification steps may be included, if necessary, arid RNA is reverse transcribed 
using gene specific promoters followed by RT-PCR. Finally, the data are analyzed to identify the 
best treatment optiori(s) available to the patient on the basis of the characteristic gene expression 
pattern identified in the tumor sample examined. The individual steps of this protocol will be 
discussed in greater detail below. 
15 7, Improved Method for Isolation of Nucleic Acid from Archived Tissue Specimens 

As discussed above, in the first step of the method of the invention, total RNA is extracted 
from the source material of interest, including fixed, paraffin-embedded tissue specimens, and 
purified sufficiently to act as a substrate in an enzyme assay. Despite the availability of 
commercial products, and the extensive knowledge available concerning the isolation of nucleic 
20 acid, such as RNA, from tissues, isolation of nucleic acid (RNA) from fixed, paraffin-embedded 
tissue specimens (FPET) is not without difficulty. 

In one aspect, the present invention concerns an improved method for the isolation of 
nucleic acid from archived, e.g. FPET tissue specimens. Measured levels of mRNA species are 
useful for defining the physiological or pathological status of cells and tissues. RT-PCR (which 
25 is discussed above) is one of the most sensitive, reproducible and quantitative methods for this 
"gene expression profiling". Paraffin-embedded, formalin-fixed tissue is the most widely 
available material for such studies. Several laboratories have demonstrated that it is possible to 
successfully use fixed-paraffin-embedded tissue (FPET) as a source of RNA for RT-PCR (Stanta 
et al, Biotechniques 11:304-308 (1991); Stanta et aL, Methods Mol Biol 86:23-26 (1998); 
30 Jackson et al, Lancet 1:1391 (1989); Jackson etal, J. Clin. Pathol 43:499-504 (1999); Finke et 
al, Biotechniques 14:448-453 (1993); Goldsworthy et al t Mol Carcinog. 25:86-91 (1999); 
Stanta and Bonin, Biotechniques 24:271-276 (1998); Godfrey et al, J. Mol Diagnostics 2:84 

(2000) ; Specht et al, J. Mol Med. 78:B27 (2000); Specht et al, Am. J. Pathol. 158:419-429 

(2001) ). This allows gene expression profiling to be carried out on the most commonly available 
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SOUI;ce Q f human biopsy specimens, and therefore potentially to create new valuable diagnostic 
and therapeutic information. 

The most widely used protocols utilize hazardous organic solvents, such as xylene, or 
octane (Finke et dl, supra) to dewax the tissue in the paraffin blocks before nucleic acid (RNA 

5 arid/or DNA) extraction. Obligatory organic solvent removal (e.g. with ethariol) and rehydration 
steps follow, which necessitate multiple manipulations, and addition of substantial total time to 
the protocol, which can take up to several days. Commercial kits and protocols for RNA 
extraction from FPET [MasterPure™ Complete DNA and RNA Purification Kit (EPICENTRE®, 
Madison, WI); Paraffin Block RNA Isolation Kit (Ambion, Inc.) and RNeasy™ Mini kit 

10 (Qiagen, Chatsworth, CA)] use xylene for deparaffinization, in procedures which typically 
require multiple centrifugations and ethanol buffer changes, and incubations following 

incubation with xylene. 

The present invention provides an improved nucleic acid extraction protocol that produces 
nucleic acid, in particular RNA, sufficiently intact for gene expression measurements. The key 

1 5 step in the nucleic acid extraction protocol herein is the performance of dewaxing without the use 
of any organic solvent, thereby eliminating the need for multiple manipulations associated with 
the removal of the organic solvent, and substantially reducing the total time to the protocol. 
According to the invention, wax, e.g. paraffin is removed from wax-embedded tissue samples by 
incubation at 65-75 °C in a lysis buffer that solubilizes the tissue and hydrolyzes the protein, 

20 following by cooling to solidify the wax. 

Figure 2 shows a flow chart of an RNA extraction protocol of the present invention in 
comparison with a representative commercial method, using xylene to remove wax. The times 
required for individual steps in the processes and for the overall processes are shown in the chart. 
As shown, the commercial process requires approximately 50% more time than the process of the 

25 invention. 

The lysis buffer can be any buffer known for cell lysis. It is, however, preferred that 
oligo-dT-based methods of selectively purifying polyadenylated mRNA not be used to isolate 
RNA for the present invention, since the bulk of the mRNA molecules are expected to be 
fragmented and therefore will not have an intact polyadenylated tail, and will not be recovered or 
30 available for subsequent analytical assays. Otherwise, any number of standard nucleic acid 
purification schemes can be used. These include chaotrope and organic solvent extractions, 
extraction using glass beads or filters, salting out and precipitation based methods, or any of the 
purification methods known in the art to recover total RNA or total nucleic acids from a 
biological source. 
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Lysis buffers are commercially available, such as, fo^^ 
Ambion. A preferred group of lysis buffers typically contains urea, and Proteinase K or other 
protease; Proteinase K is very useful in the isolation of high quality, undamaged DNA or RNA, 
since most mammalian DNases and RNases are rapidly inactivated by this enzyme, especially in 
5 . the presence of 0.5 - 1% sodium dodecyl sulfate (SDS). This is particularly important in the 
case of RNA, which is more susceptible to degradation than DNA. While DNases require metal 
ions for activity, and can therefore be easily inactivated by chelating agents, such as EDTA, there 
is no similar co-factor requirement for RNases. 

Cooling aind resultant solidification of the wax permits easy separation of the wax from 
10 the total nucleic acid, which can be conveniently precipitated, e.g. by isopropanol. Further 
processing depends on the intended purpose. If the proposed method of RNA analysis is subject 
to bias by contaminating DNA in an extract, the RNA extract can be further treated, e.g. by 
DNase, post purification to specifically remove DNA while preserving RNA. For example, if the 
goal is to isolate high quality RNA for subsequent RT-PCR amplification, nucleic acid 
15 precipitation is followed by the removal of DNA, usually by DNase treatment. However, DNA 
can be removed at various stages of nucleic acid isolation, by DNase or other techniques well 
known in the art. 

While the advantages of the nucleic acid extraction protocol of the invention are mo$t 
apparent for the isolation of RNA from archived, paraffin embedded tissue samples, the wax 
20 removal step of the present invention, which does not involve the use of an organic solvent, can 
also be included in any conventional protocol for the extraction of total nucleic acid (RNA and 
DNA) or DNA only. AH of these aspects are specifically within the scope of the invention. 

By using heat followed by cooling to remove paraffin, the process of the present invention 
saves valuable processing time, and eliminates a series of manipulations, thereby potentially 
25 increasing the yield of nucleic acid. Indeed, experimental evidence presented in the examples 
below, demonstrates that the method of the present invention does not compromise RNA yield. 
8. 5 '-multiplexed Gene Specific Priming of Reverse Transcription 
RT-PCR requires reverse transcription of the test RNA population as a first step. The 
most commonly used primer for reverse transcription is oligo-dT, which works well when RNA 
30 is intact. However, this primer will not be effective when RNA is highly fragmented as is the 
case in FPE tissues. 

The present invention includes the use of gene specific primers, which are roughly 20 
bases in length with a Tm optimum between about 58 °C and 60 °C. These primers will also 
serve as the reverse primers that drive PCR DNA amplification. 
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Another aspect of the invention is the inclusion of multiple gene-specific primers in the 
same reaction mixture. The number of such different primers can vary greatly and can be as low 
as two and as high as 40,000 or more. Table 2 displays examples of reverse primers that can be 
successfully used in carrying out the methods of the invention. Figure 9 shows expression data 
5 obtained using this multiplexed gene-specific priming strategy. Specifically, Figure 9 is a 
representation of the expression of 92 genes (a subset of genes listed in Table 1) across 70 FPE 
breast cancer specimens. The y-axis shows expression as cycle threshold times. 

An alternative approach is based on the use of random hexamers as primers for cDNA 
synthesis. However, we have experimentally demonstrated that the method of using a 
10 multiplicity of gene-specific primers is superior over the known approach using random 
hexamers. 

9 # Preparation of Fragmented mRNA for Express ion Profiling Assays 
It is of interest to analyze the abundance of specific mRNA species in biological samples, 
since this expression profile provides an index of the physiological state of that sample. mRNA 
15 is notoriously difficult to extract and maintain in its native state, consequently, mRNA recovered 
from biological sources is often fragmented or somewhat degraded. This is especially true of 
human tissue specimen which have been chemically fixed and stored for extended periods of 
time. 

In one aspect, the present invention provides a means of preparing the mRNA extracted 
20 from various sources, including archived tissue specimens, for expression profiling in a way that 
its relative abundance is preserved and the mRNA's of interest can be successfully measured. 
This method is useful as a means of preparing mRNA for analysis by any of the known 
expression profiling methods, including RT-PCR coupled with 5* exonuclease of reporter probes 
(TaqMan® type assaj's), as discussed above, flap endonuclease assays (Cleavase® and Invader® 
25 type assays), oligonucleotide hybridization arrays, cDNA hybridization arrays, oligonucleotide 
ligation assays, 3' single nucleotide extension assays and other assays designed to assess the 
abundance of specific mRNA sequences in a biological sample. 

According to the method of the invention, total RNA is extracted from the source material 
and sufficiently purified to act as a substrate in an enzyme assay. The extraction procedure, 
30 including a new and improved way of removing the wax (e.g. paraffin) used for embedding the 
tissue samples, has been discussed above. It has also been noted that it is preferred that oligo-dT 
based methods of selectively purifying polyadenylated mRNA not be used to isolate RNA for this 
invention since the bulk of the mRNA is expected to be fragmented, will not be polyadenylated 
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and, therefore, will not be recovered and available for subsequent analytical assays if an qligo-dT 
based method is used/ 

A diagram 6f an improved method for repairing fragmented RNA is shown in Figure 3. 
The fragmented RNA purified from the tissue sample is mixed with universal or gene-specific, 
5 single-stranded, DNA templates for each mRNA species of interest. These templates may be full 
length DNA copies of the mRNA derived from cloned gene sources, they may be fragments of 
the gene representing only the segment of the gene to be assayed, they may be a series of long 
oligonucleotides representing either the full length gene or die specific segment(s) of interest. 
The template can represent either a single consensus sequence or be a mixture of polymorphic 
10 variants of the gene. This DNA template, or scaffold, will preferably include one or more dUTP 
or rNTP sites in its length. This will provide a means of removing the template prior to carrying 
out subsequent analytical steps to avoid its acting as a substrate or target in later analysis assays. 
This removal is accomplished by treating the sample with uracil-DNA glycosylase (UDG) and 
heating it to cause strand breaks where UDG has generated abasic sites. In the case of rNTP's, 
15 the sample can be heated in the presence of a basic buffer (pH -10) to induce strand breaks 
where rNTP's are located in the template. 

The single stranded DNA template is mixed with the purified RNA, the mixture is 
denatured and annealed so that the RNA fragments complementary to the DNA template 
effectively become primers that can be extended along the single stranded DNA templates. DNA 
20 polymerase I requires a primer for extension but will efficiently use either a DNA or an RNA 
primer. Therefore in the presence of DNA polymerase I and dNTP's, the fragmented RNA can 
be extended along the complementary DNA templates. In order to increase the efficiency of the 
extension, this reaction can be thermally cycled, allowing overlapping templates and extension 
products to hybridize and extend until the overall population of fragmented RNA becomes 
25 represented as double stranded DNA extended from RNA fragment primers. 

Following the generation of this "repaired" RNA, the sample should be treated with UDG 
or heat-treated in a mildly based solution to fragment the DNA template (scaffold) and prevent it 
from participating in subsequent analytical reactions. 

The product resulting from this enzyme extension can then be used as a template in a 
30 standard enzyme profiling assay that includes amplification and detectable signal generation such 
as fluorescent, chemiluminescent, colorimetric or other common read outs from enzyme based 
assays. For example, for TaqMan© type assays, this double stranded DNA product is added as 
the template in a standard assay; and, for array hybridization, this product acts as the cDNA 
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template for the cRN A labeling reaction typically used to generate single-stranded, labeled RNA 
for array hybridization. 

This method of preparing template has the advantage of recovering information from 
mRNA fragments too short to effectively act as templates in standard cDNA generation schemes. 
5 In addition, this method acts to preserve the specific locations in mRNA sequences targeted by 
specific analysis assays. For example, TaqMan® assays rely on a single contiguous sequence in 
a cDNA copy of mRNA to act as a PGR amplification template targeted by a labeled reporter 
probe. If mRNA strand breaks occur in this sequence, the assay will not detect that template and 
will underestimate the quantity of that mRNA in the original sample. This target preparation 
10 method minimizes that effect from RNA fragmentation. 

The extension product formed in the RNA primer extension assay can be Controlled by 
controlling the input quantity of the single stranded DNA template and by doing limited cycling 
of the extension reaction. This is important in preserving the relative abundance of the mRNA 
sequences targeted for analysis. 
15 This method has the added advantage of not requiring parallel preparation for each target 

sequence since it is easily multiplexed. It is also possible to use large pools of random sequence 
long oligonucleotides or full libraries of cloned sequences to extend the entire population of 
mRNA sequences in the sample extract for whole expressed genome analysis rather than targeted 
gene specific analysis. 
20 10. Amplification of mRNA Species Prio r to RT-PCR 

Due to the limited amount and poor quality of mRNA that can be isolated from FPET, a 
new procedure that could accurately amplify mRNAs of interest would be very useful, 
particularly for real time quantitation of gene expression (TaqMan®) and especially for 
quantitatively large number (>50) of genes >50 to 1 0,000. 
25 Current protocols (e.g. Eberwine, Biotechniques 20:584-91 (1996)) are optimized for 

mRNA amplification from small amount of total or poly A + RNA mainly for microarray analysis. 
The present invention provides a protocol optimized for amplification of small amounts of 
fragmented total RNA (average size about 60-150 bps), utilizing gene-specific sequences as 
primers, as illustrated in Figure 4. 
30 The amplification procedure of the invention uses a very large number, typically as many 

as 100 - 190,000 gene specific primers (GSP's) in one reverse transcription run. Each GSP 
contains an RNA polymerase promoter, e.g. a T7 DNA-dependent RNA polymerase promoter, at 
the 5* end for subsequent RNA amplification. GSP's are preferred as primers because of the 
small size of the RNA. Current protocols utilize dT primers, which would not adequately 
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represent all reverse transcripts of mRNAs due to the small size of the FPET RNA. GSP's can be 
designed by optimizing usual parameters, such as length, Tm, etc. For example, GSPs can be 
designed using the Primer Express® (Applied Biosystems), or Primer 3 (MIT) software program. 
Typically at least 3 sets per gene are designed, and the ones giving the lowest Ct on FPET RNA 
5 (best performers) are selected. 

Second , strand cDNA synthesis is performed by standard procedures (see Figure 4, 
Method 1), or by GSPf primers and Taq pol under PCR conditions (e.g., 95 °C, 10 min (Taq 
activation) then 60 °C, 45 sec). The advantages of the latter method are that the second gene 
specific primer, SGF f adds additional specificity (and potentially more efficient second strand 
10 synthesis) and the option of performing several cycles of PCR, if more starting DNA is necessary 
for RNA amplification by T7 RNA polymerase. RNA amplification is then performed under 
standard conditions to generate multiple copies of cRNA, which is then used in a standard 
TaqMan® reaction. 

Although this process is illustrated by using T7-based RNA amplification, a person 

15 : ' skilled in the art will understand that other RNA polymerase promoters that do not require a 
primer, such as T3 or Sp6 can also be used, and are within the scope of the invention. 

11. A method of Elongation of Fragmented RNA and Si&seauent Amplification 
This method, which combines and modifies the inventions described in sections 9 and 10 
above, is illustrated in Figure 5. The procedure begins with elongation of fragmented mRNA. 

20 This occurs as described above except that the scaffold DNAs are tagged with the T7 RNA 
polymerase promoter sequence at their 5* ends; leading to double-stranded DNA extended from 
RNA fragments. The template sequences need to be removed after in vitro transcription. These 
templates can include dUTP or rNTP nucleotides, enabling enzymatic removal of the templates 
as described in section 9, or the templates can be removed by DNasel treatment. 

25 The template DNA can be a population representing different mRNAs of any number. A 

high sequence complexity source of DNA templates (scaffolds) can be generated by pooling 
RNA from a variety of cells or tissues. In one embodiment, these RNAs are converted into 
double stranded DNA and cloned into phagemids. Single stranded DNA can then be rescued by 
phagemid growth and single stranded DNA isolation from purified phagemids. 

30 This invention is useful because it increases gene expression profile signals two different 

ways: both by increasing test mRNA polynucleotide sequence length and by in vitro transcription 
amplification. An additional advantage is that it eliminates the need to carry out reverse 
transcription optimization with gene specific primers tagged with the T7 RNA polymerase 
promoter sequence, and thus, is comparatively fast and economical. 
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This invention can be used with a variety of different methods to profile gene expression, 
e.g., RT-PCR or a variety of DNA array methods. Just as in the previous protocol, this approach 
is illustrated by using a T7 promoter but the invention is not so limited. A person skilled in the 
art will appreciate, however, that other RNA polymerase promoters, such as T3 or Sp6 can also 
5 ' be used. 

12. Breast Cancer Gene Set. Assayed Gene Subsequences, and Clinical Application of 

Gene Expression Data 

An important aspect of the present invention is to use the measured expression of certain 
genes by breast cancer tissue to match patients to best drugs or drug combinations, and to provide 
10 prognostic information. For this purpose it is necessary to correct for (normalize away) both 
differences in the amount of RNA assayed and variability in the quality of the RNA used. 
Therefore, the assay measures and incorporates the expression of certain normalizing genes, 
including well known housekeeping genes, such as GAPDH and Cypl. Alternatively, 
normalization can be based on the mean or median signal (Cf) of all of the assayed genes or a 
1 5 large subset thereof (global normalization approach). On a gene-by-gene basis, measured 
normalized amount of a patient tumor mRNA is compared to the amount found in a breast cancer 
, tissue reference set. The number <N} of breast cancer tissues in this reference set should be 
sufficiently high to ensure that different reference sets (as a whole) behave essentially the same 
way. If this condition is met, the identity of the individual breast cancer tissues present in a 
20 particular set will have no significant impact on the relative amounts of the genes assayed. 
Usually, the breast cancer tissue reference set consists of at least about 30, preferably at least 
about 40 different FPE breast cancer tissue specimens. Unless noted otherwise, normalized 
expression levels for each mRNA/tested tumor/patient will be expressed as a percentage of the 
expression level measured in the reference set. More specifically, the reference set of a 
25 sufficiently high number (e.g. 40) tumors yields a distribution of normalized levels of each 
mRNA species. The level measured in a particular tumor sample to be analyzed falls at some 
percentile within this range, which can be determined by methods well known in the art. Below, 
unless noted otherwise, reference to expression levels of a gene assume normalized expression 
relative to the reference set although this is not always explicitly stated. 
30 The breast cancer gene set is shown in Table 1. The gene Accession Numbers, and the 

SEQ ID NOs for the forward primer, reverse primer and amplicon sequences that can be used for 
gene amplification,, are listed in Table 2. The basis for inclusion of markers, as well as the 
clinical significance of mRNA level variations with respect to the reference set, is indicated 
below. Genes are grouped into subsets based on the type of clinical significance indicated by 
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their expression levels: A. Prediction of patient response to drugs used in breast cancer treatment, 
or to drugs that are approved for other indications and could be used off-label in the treatment of 
breast cancer. B. Prognostic for survival or recurrence of cancer. 

C. Prediction of Patient Response to Therapeutic Drugs 
5 1. Molecules that specifically influence cellular sensitivity to druzs 

Table 1 lists 74 genes (shown in italics) that specifically influence cellular sensitivity to 
potent drugs, which are also listed. Most of the drugs showii are approved and already used to 
treat breast cancer (e.g., anthracyclines; cyclophosphamide; methotrexate; 5-FU and analogues). 
Several of the drugs are used to treat breast cancer off-label or are in clinical development phase 
10 (e g., bisphosphonates and anti-VEGF mAb). Several of the drugs have not been widely used to 
treat breast cancer but are used in other cancers in which the indicated target is expressed (e.g., 
Celebrex is used to treat familial colon cancer; cisplatin is used to treat ovarian and other 
cancers.) 

Patient response to 5FU is indicated if normalized thymidylate synthase mRNA amount is 
15 at or below the 15 th percentile, or the sum of expression of thymidylate synthase plus 
dihydropyrimidine phosphorylase is at or below the 25 th percentile, or the sum of expression of 
these mRNAs plus thymidine phosphorylase is at or below the 20 th percentile. Patients with 
dihydropyrimidine dehydrogenase below 5 th percentile are at risk of adverse response to 5FU, or 
analogs such as Xeloda. 

20 When levels of thymidylate synthase, and dihydropyrimidine dehydrogenase, are within 

the acceptable range as defined in the preceding paragraph, amplification of c-myc mRNA in the 
upper 15%, against a background of wild-type p53 [as defined below] predicts a beneficial 
response to 5FU (see D. Arango et aL 9 Cancer Res. 61:4910-4915 (2001)). In the presence of 
normal levels of thymidylate synthase and dihydropyrimidine dehydrogenase, levels of NFkB 

25 and cIAP2 in the upper 10% indicate resistance of breast tumors to the chemotherapeutic drug 
5FU. 

Patient resistance to anthracyclines is indicated if the normalized mRNA level of 
topoisomerase Hoc is below the 10 th percentile, or if the topoisomerase lip normalized mRNA 
level is below the 10 th percentile or if the combined normalized topoisomerase Ila and {J signals 
30 are below the 1 0 th percentile. 

Patient sensitivity to methotrexate is compromised if DHFR levels are more than tenfold 
higher than the average reference set level for this mRNA species, or if reduced folate carrier 
levels are below 10 th percentile. 
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Patients whose tumors express CYP1B1 in the upper 10%, have reduced likelihood of 

responding to docetaxol. 

The sum of signals for aldehyde dehydrogenase 1A1 and 1A3, when more than tenfold 
higher than the reference set average, indicates reduced likelihood of response to 
cyclophosphamide. 

Currently, estrogen and progesterone receptor expression as measured by 
immunohistochemistry is used to select patients for anti-estrogen therapy. We have 
demonstrated RT-PCR assays for estrogen and progesterone receptor mRNA levels that predict 
levels of these proteins as determined by a standard clinical diagnostic tests, with high degree of 
concordance (Figures 6 and 7). 

Patients whose tumors express ERa or PR mRNA in the upper 70%, are likely to respond 
to tamoxifen or other anti-estrogens (thus, operationally, lower levels of ERa than this are to 
defined ERa-riegative). However, when the signal for microsomal epoxide hydrolase is in the 
upper 10% or when mRNAs for pS2/trefoil factor, GATA3 or human chorionic gonadotropin are 
at or below average levels found in ERa-negative tumors, anti-estrogen therapy will not be 
beneficial. 

Absence of XlST signal compromises the likelihood of response to taxanes, as does 
elevation of the GST-ti or prolyl endopeptidase [PREP] signal in the upper 10%. Elevation of 
PLAG1 in the upper 10% decreases sensitivity to taxanes. 

Expression of ERCC1 mRNA in the upper 10% indicate significant risk of resistance to 

cisplatin or analogs. 

An RT-PCR assay of Her2 mRNA expression predicts Her2 overexpression as measured 
by a standard diagnostic test, with high degree of concordance (data not shown). Patients whose 
tumors express Her2 (normalized to cyp.l) in the upper 10% have increased likelihood of 
beneficial response to treatment with Herceptin or other ErbB2 antagonists. Measurement of 
expression of Grb7 mRNA serves as a test for HER2 gene amplification, because the Grb7 gene 
is closely linked to Her2. When Her2 is expression is high as defined above in this paragraph, 
similarly elevated Grb7 indicates Her2 gene amplification. Overexpression of IGF1R and or 
IGF1 or IGF2 decreases likelihood of beneficial response to Herceptin and also to EGFR 
. antagonists. 

Patients whose tumors express mutant Ha-Ras* and also express farnesyl pyrophosphate 
synthetase or geranyl pyrophosphonate synthetase mRNAs at levels above the tenth percentile 
comprise a group that is especially likely to exhibit a beneficial response to bis-phosphonate 
drugs. 
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Cox2 is a key control enzyme in the synthesis of prostaglandins. It is frequently expressed 
at elevated levels in subsets of various types of carcinomas including carcinoma of the breast* * 
Expression of this gene is controlled at the transcriptional level, so RT-PCR serves a valid 
indicator of the cellular enzyme activity. Nonclinical research has shown that cox2 promotes 

5 tumor angiogenesis, suggesting that this enzyme is a promising drug target in solid tumors. 
Several Cox2 antagonists are marketed products for use in anti-inflammatory conditions. 
Treatment of familial adenomatous polyposis patients with the cox2 inhibitor Celebrex 
significantly decreased the number and size of neoplastic polyps. No cbx2 inhibitor has yet been 
approved for treatment of breast cancer, but generally this class of drugs is safe and could be 

10 prescribed off-label in breast cancers in which cox2 is over-expressed. Tumors expressing 
COX2 at levels in the upper ten percentile have increased chance of beneficial response to 
Celebrex or other cyclooxygenase 2 inhibitors. 

The tyrosine kinases ErbBl [EGFR], ErbB3 [Her3] and ErbB4 [Her4]; also the ligands 
TGFalpha, amphiregulin, heparin-binding EGF-like growth factor, and epiregulin; also BRK, a 

15 non-receptor kinase. Several drugs in clinical development block the EGF receptor. ErbB2-4, the 
indicated ligands, and BRK also increase the activity of the EGFR pathway. Breast cancer 
patients whose tumors express high levels of EGFR or EGFR and abnormally high levels of the 
other indicated activators of the EGFR pathway are potential candidates for treatment with an 
EGFR antagonist. 

20 Patients whose tumors express less than 10% of the average level of EGFR mRNA 

observed in the reference panel are relatively less likely to respond to EGFR antagonists [such as 
Iressa, or ImClone 225]. In cases in which the EGFR is above this low range, the additional 
presence of epiregulin, TGFoc, amphiregulin, or ErbB3, or BRK, CD9, MMP9, or Lotl at levels 
above the 90 th percentile predisposes to response to EGFR antagonists. Epiregulin gene 

25 expression, in particular, is a good surrogate marker for EGFR activation, and can be used to not 
only to predict response to EGFR antagonists, but also to monitor response to EGFR antagonists 
. [taking fine needle biopsies to provide tumor tissue during treatment]. Levels of CD82 above the 
90 th percentile suggest poorer efficacy from EGFR antagonists. 

The tyrosine kinases abl, c-kit, PDGFRalpha, PDGFbeta, and ARG; also, the signal 

30 transmitting ligands c-kit Iigand, PDGFA, B, C and D. The listed tyrosine kinases are all targets 

of the drug Gleevec™ (imatinib mesylate, Novartis), and the listed ligands stimulate one or more 

of the listed tyrosine kinases. In the two indications for which Gleevec™ is approved, tyrosine 

kinase targets (bcr-abl and ckit) are overexpressed and also contain activating mutations. A 

finding that one of the Gleevec™ target tyrosine kinase targets is expressed in breast cancer tissue 
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will prompt a second stage of analysis wherein the gene will be sequenced to determine whether- 
it is mutated. That a mutation found is an activating mutation can be proved by methods known 
in the art, such as, for example, by measuring kinase enzyme activity or by measuring 
phosphorylation status of the particular kinase, relative to the corresponding wild-type kinase. 
5 Breast cancer patients whose tumors express high levels of mRNAs encoding Gleevec™ target 
tyrosine kinases, specifically, in the upper ten percentile, or mRNAs for Gleevec™ target tyrosine 
kinases in the average range and mRNAs for their cognate growth stimulating ligands in the 
upper ten percentile, are particularly good candidates for treatment with Gleevec™. 

VEGF is a potent and pathologically important angiogenic factor. (See below Under 
10 Prognostic Indicators.) When VEGF mRNA levels are in the upper ten percentile, aggressive 
treatment is warranted. Such levels particularly suggest the value of treatment with anti- 
angiogenic drugs, including VEGF antagonists, such as anti-VEGF antibodies. Additionally, 
KDR or CD3 1 mRNA level in the upper 20 percentile further increases likelihood of benefit from 
VEGF antagonists. 

15 Famesyl- pyrophosphatase synthetase and geranyl geranyl pyrophosphatase synthetase. 

These enzymes are targets of commercialized bisphosphonate drugs, which were developed 
originally for treatment of osteoporosis but recently have begun to prescribe them off-label in 
breast cancer. Elevated levels of mRNAs encoding these enzymes in breast cancer tissue, above 
the 90 th percentile, suggest use of bisphosphonates as a treatment option. 

20 2. Multidrug Resistance Factors 

These factors include 10 Genes: gamma glutamyl cysteine synthetase [GCS]; GST-a; 
GST-7t; MDR-1; MRP1-4; breast cancer resistance protein [BCRP]; lung resistance protein 
[MVP];SXR; YB-1. 

GCS and both GST-a and GST-7t regulate glutathione levels, which decrease cellular 
25 sensitivity to chemotherapeutic drugs and other toxins by reductive derivatization. Glutathione is 
a necessary cofactor for multi-drug resistant pumps, MDR-1 and the MRPs. MDR1 and MRPs 
function to actively transport out of cells several important chemotherapeutic drugs used in breast 
cancer. 

GSTs, MDR-1, and MRP-1 have all been studied extensively to determine possible have 

30 prognostic or predictive significance in human cancer. However, a great deal of disagreement 

exists in the literature with respect to these questions. Recently, new members of the MRP 

family have been identified: MRP-2, MRP-3, MRP-4, BCRP, and lung resistance protein [major 

vault protein]. These have substrate specificities that overlap with those of MDR-1 and MRP-1 v 

The incorporation of all of these relevant ABC family members as well as glutathione synthetic 
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enzymes into the present invention captures the contribution of this family to drug resistance, in a 
way that single or double analyte assays cannot 

MRP-1, the gene coding for the multidrug resistance protein, 

p-glycoprotein, is not regulated primarily at the transcriptional level. However, p- 
5 glycoprotein stimulates the transcription of PTPlb. An embodiment of the present invention is 
the use of the level of the mRNA for the phosphatase PTPl b as a surrogate measure of MRP-l/p- 
glycoprotein activity. 

The gene SXR is also an activator of multidrug resistance, as it stimulates transcription of 
certain multidrug resistance factors. 
10 The impact of multidrug resistance factors with respect to chemotherapeutic agents used 

in breast cancer is as follows. Beneficial response to doxorubicin ii compromised when the 
mRNA levels of either MDR1, GSTa, GSTn, SXR, BCRP YB-1, or LRP/MVP are in the upper 
four percentile. Beneficial response to methotrexate is inhibited if mRNA levels of any of 
MRP1, MRP2, MRP3, or MRP4 or gamma-glutamyl cysteine synthetase are in the upper four 
15 percentile. .'- \ ' 

3. Eukarvotic Translation Initiation Factor 4E [E1F4EJ 

EIF4E mRNA levels provides evidence of protein expression and so expands the 
capability of RT-PCR to indicate variation in gene expression. Thus, one claim of the present 
invention is the use of EIF4E as an added indicator of gene expression of certain genes [e.g., 
20 cyclinDl, mdm2, VEGF, and others]. For example, in two tissue specimens containing the same 
amount of normalized VEGF mRNA, it is likely that the tissue containing the higher normalized 
level of EIF4E exhibits the greater level of VEGF gene expression. 

The background is as follows. A key point in the regulation of mRNA translation is 
selection of mRNAs by the EIF4G complex to bind to the 43S ribosomal subunit. The protein 
25 EIF4E [the m7G CAP-binding protein] is often limiting because more mRNAs than EIF4E 
copies exist in cells. Highly structured 5'UTRs or highly GC-rich ones are inefficiently 
translated, and these often code for genes that carry out functions relevant to cancer [e.g., 
cyclinDl, mdm2, and VEGF]. EIF4E is itself regulated at the transcriptional/ mRNA level. 
Thus, expression of EIF4E provides added indication of increased activity of a number of 
30 proteins. 

It is also noteworthy that overexpression of EIF4E transforms cultured cells, and hence is 
an oncogene. Overexpression of EIF4E occurs in several different types of carcinomas but is 
particularly significant in breast cancer. EIF4E is typically expressed at very low levels in normal 
breast tissue. 
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D. Prognostic Indicators 

1. DNA Repair Enzymes 

Loss of BRCA1 or BRCA2 activity via mutation represents the critical oncogenic step in 
the most common type[s] of familial breast cancer. The levels of mRNAs of these important 
5 enzymes are abnormal in subsets of sporadic breast cancer as well. Loss of signals from either 
[to within the lower ten percentile] heightens risk of short 

2. Cell Cycle Regulators . 

Cell cycle regulators include 14 genes: c-MYC; c-Src; Cyclin Dl; Ha-Ras; mdm2; 
pHARF; p21 WAF1/CIP; P 16INK4a/pl4; p23; p27; p53; PI3K; PKC-epsilon; PKC-delta, 

10 The gene for p53 [TP53] is mutated in a large fraction of breast cancers. Frequently p53 

levels are elevated when loss of function mutation occurs. When the mutation is dominant- 
negative, it creates survival value for the cancer cell because growth is promoted and apoptosis is 
inhibited. Thousands of different p53 mutations have been found in human cancer, and the 
functional consequences of many of them are not clear. A large body of academic literature 

15 addresses the prognostic and predictive significance of mutated p53 and the results are highly 
conflicting. The present invention provides a functional genomic measure of p53 activity, as 
follows. The activated wild type p53 molecule triggers transcription of the cell cycle inhibitor 
p21. Thus, the ratio of p53 to p21 should be low when p53 is wild-type and activated. When p53 
is detectable and the ratio of p53 to p21 is elevated in tumors relative to normal breast, it 

20 signifies nonfunctional or dominant negative p53. The cancer literature provides evidence for 
this as bom out by poor prognosis. 

Mdm2 is an important p53* regulator. Activated wildtype p53 stimulates transcription of 
mdm2. The mdm2 protein binds p53 and promotes its proteolytic destruction. Thus, abnormally 
low levels of mdm2 in the presence of normal or higher levels of p53 indicate that p53 is mutated 

25 and inactivated. 

One aspect of the present invention is the use of ratios of mRNAs levels p53:p21 and 
p53:mdm2 to provide a picture of p53 status. Evidence for dominant negative mutation of p53 
(as indicated by high p53:p21 and/or high p53:mdm2 mRNA ratios— specifically in the upper ten 
percentile) presages higher risk of recurrence in breast cancer and therefore weights toward a 
30 decision to use chemotherapy in node negative post surgery breast cancer. 

Another important cell cycle regulator is p27, which in the activated form blocks cell 
cycle progression at the level of cdk4. The protein is regulated primarily via 
phosphorylation/dephosphorylation, rather than at the transcriptional level. However, levels of 
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p27 mRNAs do vary. Therefore a level of p27 mRNA in the upper ten percentile indicates 
reduced risk of recurrence of breast cancer post surgery, 

Cyclin Dl is a principle positive regulator of entry into S phase of the cell cycle. The gene 
for cyclin Dl is amplified in about 20% of breast cancer patients, and therefore promotes tumor 
promotes tumor growth in those cases. One aspect of the present invention is use of cyclin Dl 
mRNA levels for diagnostic purposes in breast cancer. A level of cyclin Dl mRNA in the upper 
ten percentile suggests high risk of recurrence in breast cancer following surgery and suggests 
particular benefit of adjuvant chemotherapy. 

3. Other tumor suppressors and related proteins 

These include APC and E-cadherin. It has long been known that the tumor suppressor 
APC is lost in about 50% of colon cancers, with concomitant transcriptional upregulatiori of E- 
cadherin, an important cell adhesion molecule and growth suppressor. Recently, it has been 
found that the APC gene silenced in 15-40 % of breast cancers. Likewise, the E-cadherin gene is 
silenced [via CpG island methylation] in about 30% of breaist cancers. An abnormally low level 
of APC and/or E-cadherin mRNA in the lower 5 percentile suggests high risk of recurrence in 
breast cancer following surgery and heightened risk of shortened survival. 

4. Regulators of Avoptosis 

These include BC1/BAX family members BC12, Bcl-xl, Bale, Bax and related factors, 
NFk-B and related factors, and also p53BPl/ASPPl and p53BP2/ASPP2. 

Bax and Bak are pro-apoptotic and BC12 and Bcl-xl are anti-apoptotic. Therefore, the 
ratios of these factors influence the resistance or sensitivity of a cell to toxic (pro-apoptotic) 
drugs. In breast cancer, unlike other cancers, elevated level of BC12 (in the upper ten percentile) 
correlates with good outcome. This reflects the fact that BC12 has growth inhibitory activity as 
well as anti-apoptotic activity, and in breast cancer the significance of the former activity 
outweighs the significance of the latter. The impact of BC12 is in turn dependent on the status of 
the growth stimulating transcription factor c-MYC. The gene for c-MYC is amplified in about 
20% of breast cancers. When c-MYC message levels are abnormally elevated relative to BC12 
(such that this ratio is in the upper ten percentile), then elevated level of BC12 mRNA is no 
longer a positive indicator. 

NFk-B is another important anti-apoptotic factor. Originally, recognized as a pro- 
inflammatory transcription factor, it is now clear that it prevents programmed cell death in 
response to several extracellular toxic factors [such as tumor necrosis factor]. The activity of this 
transcription factor is regulated principally via phosphorylation/dephosphorylation events. 
However, levels of NFk-B nevertheless do vary from cell to cell, and elevated levels should 
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correlate with increased resistance to apoptosis. Importantly for present purposes, NFk-B, exerts 
its anti-apoptotic activity largely through its stimulation of transcription of mRNAs encoding 
certain members of the IAP [inhibitor of apoptosis] family of proteins, specifically cIAPl, 
cIAP2, XIAP, and Survivin. Thus, abnormally elevated levels of mRNAs for these IAPs and for 
5 NFk-B any in the upper 5 percentile] signify activation of the NFk-B anti-apoptotic pathway. 
This suggests high risk of recurrence in breast cancer following chemotherapy and therefore poor 
prognosis. One embodiment of the present invention is the inclusion in the gene set of the above 
apoptotic regulators, and the above-outlined use of combinations and ratios of the levels of their 
mRNAs for prognosis in breast cancer. 
10 The proteins p53BPl and 2 bind to p53 and promote transcriptional activation of pro- 

apoptotic genes. The levels of p53BPl and 2 are suppressed in a significant fraction of breast 
cancers, correlating with poor prognosis. When either is expressed in the lower tenth percentile 
poor prognosis is indicated. 

5. Factors that control cell invasion and angiogenesis 
15 These include uPA, PAI1, cathepsinsB, G and L, scatter factor [HGF], c-met, KDR, 

VEGF, and CD31. The plasminogen activator uP A and its serpin regulator PAI1 promote 
breakdown of extracellular matrices and tumor cell invasion. Abnormally elevated levels of both 
mRNAs in malignant breast tumors (in the upper twenty percentile) signify an increased risk of 
shortened survival, increased recurrence in breast cancer patients post surgery, and increased 
20 importance of receiving adjuvant chemotherapy. On the other hand, node negative patients 
whose tumors do not express elevated levels of these mRNA species are less likely, to have 
recurrence of this cancer and could more seriously consider whether the benefits of standard 
chemotherapy justifies the associated toxicity. 

Cathepsins B or L, when expressed in the upper ten percentile, predict poor disease-free 
25 and overall survival. In particular, cathepsin L predicts short survival in node positive patients. 

Scatter factor and its cognate receptor c-met promote cell motility and invasion, cell 
growth, and angiogenesis. In breast cancer elevated levels of mRNAs encoding these factors 
should prompt aggressive treatment with chemotherapeutic drugs, when expression of either, or 
the combination, is above the 90 th percentile. 
30 VEGF is a central positive regulator of angiogenesis, and elevated levels in solid tumors 

predict short survival [note many references showing that elevated level of VEGF predicts short 
survival]. Inhibitors of VEGF therefore slow the growth of solid tumors in animals and humans. 
VEGF activity is controlled at the level of transcription. VEGF mRNA levels in the upper ten 
percentile indicate significantly worse than average prognosis. Other markers of vascularization, 
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CD31 [PECAM], and KDR indicate high vessel density in tumors and that the tumor will be 
particularly malignant and aggressive, and hence that aft aggressive therapeutic strategy is 
warranted. 

6. Markers for Immune and Inflammatory Cells and Processes 
5 These markers include the genes for Immunoglobulin light chain X, GDI 8, CD3, CD68, 

Fas [CD95], and Fas Ligand. 

Several lines of evidence suggest that the mechanisms of action of certain drugs used in 
breast cancer entail activation of the host immune/inflammatory response (For example, 
Herceptin®). One aspect of the present invention is the inclusion in the gene set of markers for 
10 inflammatory and immune cells, and markers that predict tumor resistance to immune 
surveillance. Immunoglobulin light chain lambda is a marker for immunoglobulin producing 
cells. CD 18 is a marker for all white cells. CD3 is a marker for T-cells. CD68 is a marker for 
macrophages* 

CD95 and Fas Hgand are a receptor: ligand pair that mediate one of two major pathways 
15 by which cytotoxic T cells and NK cells kill targeted cells. Decreased expression of CD95 and 
increased expression of Fas Ligand indicates poor prognosis in breast cancer. Both CD95 and 
Fas Ligand are transmembrane proteins, and need to be membrane anchored to trigger cell death. 
Certain tumor cells produce a truncated soluble variant of CD95, created as a result of alternative 
splicing of the CD95 mRNA. This blocks NK cell and cytotoxic T cell Fas Ligand-mediated 
20 killing of the tumors cells. Presence of soluble CD95 correlates with poor survival in breast 
cancer. The gene set includes both soluble and full-length variants of CD95. 

7. Cell proliferation markers 

The gene set includes the cell proliferation markers Ki67/MiBl, PCNA, Pinl, and 
thymidine kinase. High levels of expression of proliferation markers associate with high 
25 histologic grade, and short survival. High levels of thymidine kinase in the upper ten percentile 
suggest in creased risk of short survival. Pinl is a prolyl isomerase that stimulates cell growth, in 
part through the transcriptional activation of the cyclin Dl gene, and levels in the upper ten 
percentile contribute to a negative prognostic profile. 

8. Other growth factors and receptors 

30 This gene set includes IGF1 , IGF2, IGFBP3, IGF1R, FGF2, FGFR1, CSF-lR/fms, CSF-1, 

DL6 and IL8. All of these proteins are expressed in breast cancer. Most stimulate tumor growth. 
However, expression of the growth factor FGF2 correlates with good outcome. Some have anti- 
apoptotic activity, prominently IGF1. Activation of the IGF1 axis via elevated IGF1, IGF1R, or 
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lU± v tti'3 (as indicated by the sum ot these signals in the upper ten percentile) inhibits tumor cell , 
death and strongly contributes to a poor prognostic profile. 

9. Gene expression markers that define subclasses of breast cancer 
These , include: GROl oncogene alpha, Grb7, cytokeratins 5 and 17, retinal binding 
5 protein 4, hepatocyte nuclear factor 3, integrin alpha 7, and lipoprotein lipase. These^markers 
subset breast cancer into different cell types that are phenotypically different at the level of gene 
expression. Tumors expressing signals for Bcl2, hepatocyte nuclear factor 3, LIVl and ER above 
the mean have the best prognosis for disease free and overall survival following surgical removal 
of the cancer. Another category of breast cancer tumor type, characterized by elevated 
10 expression of lipoprotein lipase, retinol binding protein 4, and integrin a7, carry intermediate 
prognosis. Tumors expressing either elevated levels of cytokeratins 5, and 1 7, GRO oncogene at 
levels four-fold or greater above the mean, or ErbB2 and Grb7 at levels ten-fold or more above 
the mean, have worst prognosis. 

Although throughout the present description, including the Examples below, various 
15 aspects of the invention are explained with reference to gene expression studies, the invention 
can be performed in a similar manner, and similar results can be reached by applying proteomics 
techniques that are well known in the art. The proteome is the totality of the proteins present in a 
sample (e.g. tissue, organism, or cell! culture) at a certain point of time. Proteomics includes, 
among other things, study of the global changes of protein expression in a sample (also referred 
20 to as "expression proteomics"). Proteomics typically includes the following steps: (1) separation 
of individual proteins in a sample by 2-D gel electrophoresis (2-D PAGE); (2) identification of 
the individual proteins recovered from the gel, e.g. my mass spectrometry and/or N-terminal 
sequencing, and (3) analysis of the data using bioinformatics. Proteomics methods are valuable 
supplements to other methods of gene expression profiling, and can be used, alone or in 
25 combination with other methods of the present invention, to detect the products of the gene 
markers of the present invention. 

Further details of the invention will be described in the following non-limiting Examples. 
Example 1 

Isolation of RNA from formalin-fixed, paraffin-embedded (FPETt tissue specimens 
30 A. Protocols 

I. EPICENTRE© Xylene Protocol 
RNA Isolation 

(1) Cut 1-6 sections (each 10 jim thick) of paraffin-embedded tissue per sample using 
a clean microtome blade and place into a 1 .5 ml eppendorf tube. 
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(2) To extract paraffin, add 1 ml of xylene and invert the tubes for 10 minutes by 
rocking on a nutator. 

(3) Pellet the sections by centrifugation for 10 minutes at 14,000 x g in an eppendorf 
microcentrifuge. 

5 (4) Remove the xylene, leaving some in the bottom to avoid dislodging the pellet. 

(5) Repeat steps 2-4. 

(6) Add 1 ml of 100% ethanol and invert for 3 minutes by rocking on the nutator. 

(7) Pellet the debris by centrifugation for 10 minutes at 14,000 x g in an eppendorf 
microcentrifuge. 

10 (8) Remove the ethanol, leaving some at the bottom to avoid the pellet. 

(9) Repeat steps 6r8 twice. 

(10) Remove all of the remaining ethanol. 

(1 1) For each sample, add 2 *il of 50 tig/\xl Proteinase K to 300 jil of Tissue and Cell 
Lysis Solution. 

1 5 (12) Add 300 jal of Tissue and Cell Lysis Solution containing the Proteinase K to each 

sample and mix thoroughly; 

(13) Incubate at 65 °C for 90 minutes (vortex mixing every 5 minutes). Visually 
monitor the remaining tissue fragment. If still visible after 30 minutes, add an additional 2 pi of 
50 jig/pl Proteinase K and continue incubating at 65 °C until fragment dissolves. 
20 (14) Place the samples on ice for 3-5 minutes and proceed with protein removal and 

total nucleic acid precipitation. - 
Protein Removal and Precipitation of Total Nucleic Acid 

(1) Add 150 nl of MPC Protein Precipitation Reagent to each lysed sample and 
vortex vigorously for 10 seconds. 
.25 (2) Pellet the debris by centrifugation for 10 minutes at 14,000 x g in an eppendorf 

microcentrifuge. 

(3) Transfer the supernatant into clean eppendorf tubes and discard the pellet. 

(4) Add 500 jil of isopropanol to the recovered supernatant and thoroughly mix by 
rocking on the nutator for 3 minutes. 

30 (5) Pellet the RNA/DNA by centrifugation at 4 °C for 10 minutes at 14,000 x g in an 

eppendorf microcentrifuge. 

(6) Remove all of the isopropanol with a pipet, being careftil not to dislodge the 

pellet. 
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Removal of Contaminating DNA from RNA Preparations 

(1) Prepare 200 jxi of DNase I solution for each sample by adding 5 pi of RNase-Free 
DNase I (1 U/^l) to 195 ^1 of IX DNase Buffer. 

(2) Completely resuspend the pelleted RNA in 200 pi of DNase I solution by 

5 vortexing. 

(3) Incubate the samples at 37 °C for 60 minutes. 

(4) Add 200 jxl of 2X T and C Lysis Solution to each sample and vortex for 5 
seconds. 

(5) Add 200 >il of MPC Protein Precipitation Reagent, mix by vortexing for 10 
10 seconds and place on ice for 3-5 minutes. 

(6) Pellet the debris by centrifugation for 10 minutes at 14,000 x g in an eppendorf 
microcentrifuge. 

(7) Transfer the supernatant containing the RNA to clean eppendorf tubes and discard 
the pellet. (Be careful to avoid transferring the pellet.) 

15 (8) Add 500 |xl of isopropanol to each supernatant and rock samples on the nutator for 

3 minutes. 

(9) Pellet the RNA by centrifugation at 4 °C for 10 minutes at 14,000 x g in an 
eppendorf microcentrifuge. 

(10) Remove the isopropanol, leaving some at the bottom to avoid dislodging the 

20 pellet. 

(11) Rinse twice with 1 ml of 75% ethanol. Centrifuge briefly if the RNA pellet is 

dislodged. 

(12) Remove ethanoi carefully. 

(13) Set under fume hood for about 3 minutes to remove residual ethanol. 
25 (14) Resuspend the RNA in 30 nl of TE Buffer and store at -30 °C. 

II. Hot Wax/Urea Protocol of the Invention 
RNA Isolation 

(1) Cut 3 sections (each 10 ^ni thick) of paraffin-embedded tissue using a clean 
microtome blade and place into a 1 .5 ml eppendorf tube. 
30 (2) Add 300 pi of lysis buffer (10 mM Tris 7.5, 0.5% sodium lauroyl sarcosine, Q.l 

mM EDTA pH 7.5, 4M Urea) containing 330 pg/ml Proteinase K (added freshly from a 50 pg/^1 
stock solution) and vortex briefly. 
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(3) Incubate at 65 °C for 90 minutes (vortex mixing every 5 minutes). Visually 
monitor the tissue fragment. If still visible after 30 minutes, add an additional 2 \il of 50 jag/|il 
Proteinase K and continue incubating at 65 °C until fragment dissolves. 

(4) Centrifuge for 5 minutes at 14,000 x g and transfer upper aqueous phase to new 
5 tube, being careful not to disrupt the paraffin seaL 

(5) Place the samples on ice for 3-5 minutes and. proceed with protein removal and 
total nucleic acid precipitation. 

Protein Removal and Precipitation of Total Nucleic Acid 

(1) Add 150 ill of 7.5M NHLtOAc to each lysed sample and vortex vigorously for 10 
10 seconds. 

(2) Pellet the debris by centrifiigation for 1 0 minutes at 1 4,000 x g in an eppendorf 
microcentrifuge. 

(3) Transfer the supernatant into clean eppendorf tubes and discard the pellet. 

(4) . Add 500 \i\ of isopropanol to the recovered supernatant and thoroughly mix by 
15 rocking oh the nutator for 3 minutes. 

(5) Pellet the RNA/DNA by centrifiigation at 4 °C for 10 minutes at 14,000 x g in an 
eppendorf microcentrifuge. 

(6) Remove all of the isopropanol with a pipet, being careful not to dislodge the 

pellet. 

20 Removal of Contaminating DNA from RNA Preparations 

(1) Add 45 ill of IX DNase I buffer (10 mM Tris-Cl, pH 7,5, 2.5 mM MgCfe, 0.1 mM 
CaCb) and 5 jLxl of RNase-Free jDNase I (2U/jil, Ambion) to each sample. 

(2) Incubate the samples at 37 °C for 60 minutes. 
Inactivate the DNasel by heating at 70 °C for 5 minutes. 

25 B. Results 

Experimental evidence demonstrates that the hot RNA iextraction protocol of the 
invention does not compromise RNA yield. Using 19 FPE breast cancer specimens, extracting 
RNA from three adjacent sections in the same specimens, RNA yields were measured via 
capillary electrophoresis with fluorescence detection (Agilent Bioanalyzer). Average RNA yields 
30 in nanograms and standard deviations with the invented and commercial methods, respectively, 
were: 139+/-21 versus 141+/-34. 

Also, it was found that the urea-containing lysis buffer of the present invention can be 
substituted for the EPICENTRE® T&C lysis buffer, and the 7.5 M NH4OAC reagent used for 
protein precipitation in accordance with tf_ present invention can be substituted for the 
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EPICENTRE® MPC protein precipitation solution with neither significant compromise of RNA 
yield nor TaqMan® efficiency. 
Example 2 

' Amplification of mRNA Species Prior to RT-PCR 
5 The method described in section 10 above was used with RNA isolated from fixed, 

paraffin-embedded breast cancer tissue. TaqMan® analyses were performed with first strand 
cDNA generated with the T7-GSP primer (unamplified (T7-GSPr)), T7 amplified RNA 
(amplified (T7-GSPr)). RNA was amplified according to step 2 of Figure 4. As a control, 
TaqMan® was also performed with cDNA generated with an unmodified GSPr (amplified 
10 (GSPr)). An equivalent amount of initial template (1 ng/well) was used in each TaqMan® 
. reaction. 

The results are shown in Figure 8. In vitro transcription increased RT-PCR signal 
intensity by more than 10 fold, and for certain genes by more than 100 fold relative to controls in 
which the RT-PCR primers were the same primers used in method 2 for the generation of 
15 double-stranded DNA for in vitro transcription (GSP-T7, and GSPr). Also shown in Figure 8 are 
RT-PCR data generated when standard optimized RT-PCR primers (i.e., lacking T7 tails) were 
used. As shown, compared to this control, the new method yielded substantial increases in RT- 
PCR signal (from 4 to 64 fold in this experiment). 

The new method requires that each T7-GSP sequence be optimized so that the increase in 
20 the RT-PCR signal is the same for each gene, relative to the standard optimized RT-PCR (with 
non-T7 tailed primers). 
Example 3 

A Study of Gene Expression in Premalienant and Malig nant Breast Tumors > 

A gene expression study was designed and conducted with the primary goal to 

25 moiecularly characterize gene expression in paraffin-embedded, fixed tissue samples of invasive 

breast ductal carcinoma, and to explore the correlation between such molecular profiles and 

disease-free survival. A further objective of the study was to compare the molecular profiles in 

tissue samples of invasive breast cancer with the molecular profiles obtained in ductal carcinoma 

in situ. The study was further designed to obtain data on the molecular profiles in lobular 

30 carcinoma in situ and in paraffin-embedded, fixed tissue samples of invasive lobular carcinoma. 

Molecular assays were performed on paraffin-embedded, formalin-fixed primary breast 

tumor tissues obtained from 202 individual patients diagnosed with breast cancer. All patients 

underwent surgery with diagnosis of invasive ductal carcinoma of the breast, pure ductal 

carcinoma in situ (DCIS), lobular carcinoma of the breast, or pure lobular carcinoma in situ 
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(LCIS). Patients were included in the study only if histopathologic assessment, performed as 
described in the Materials and Methods section, indicated adequate amounts of tumor tissue and • 
homogeneous pathology. 

The individuals participating in the study were divided into the following groups: 
5 Group 1: Pure ductal carcinoma in situ (DCIS); n=18 

Group 2: Invasive ductal carcinoma n= 130 

Group 3: Pure lobular carcinoma in situ (LCIS); iy=7 

Group 4: Invasive lobular carcinoma n= 16 

10 Materials and Methods 

Each representative tumor block was characterized by standard histopathology for 
diagnosis, semi-quantitative assessment of amount of tumor, and tumor grade. A total of 6 
sections (10 microns in thickness each) were prepared and placed in two Costar Brand 
Microcentrifuge Tubes (Polypropylene, 1.7 mL tubes, clear; 3 sections in each tube). If the 

15 tumor constituted less than 30% of the total specimen area, the sample may have been crudely 
dissected by the pathologist, using gross microdissection, putting the tumor tissue directly into 
the Costar tube. 

If more than one tumor block was obtained as part of the surgical procedure, all tumor 
blocks were subjected to the same characterization, as described above, and the block most 
20 representative of the pathology was used for analysis. 
Gene Expression Analysis 

mRNA was extracted and purified from fixed, paraffin-embedded tissue samples, and 
prepared for gene expression analysis as described in chapters 7-11 above. 

Molecular assays of quantitative gene expression were performed by RT-PCR, using the 
25 ABI PRISM 7900™ Sequence Detection System™ (Perkin-Elmer-Applied Biosystems, Foster 
City, CA, USA). ABI PRISM 7900™ consists of a thermocycler, laser, charge-coupled device 
(CCD), camera and computer. The system amplifies samples in a 384-well format on a 
thermocycler. During amplification, laser-induced fluorescent signal is collected in real-time 
through fiber optics cables for all 384 wells, and detected at the CCD. The system includes 
30 software for running the instrument and for analyzing the data. 

Analysis and Results 

Tumor tissue was analyzed for 185 cancer-related genes and 7 reference genes. The 
threshold cycle (CT) values for each patient were normalized based on the median of all genes 
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for that particular patient. Clinical outcome data were available for all patients from a review of 
registry data and selected patient charts. 
Outcomes were classified as: 

0 died due to breast cancer or to unknown cause or alive with breast cancer 
5 recurrence; 

1 alive without breast cancer recurrence or died due to a cause other than breast 

cancer . 

Analysis was performed by: 

1. Analysis of the relationship between normalized gene expression and the binary 

10 outcomes of 0 or 1. 

2. Analysis of the relationship between normalized gene expression and the time to 
outcome (0 or 1 as defined above) where patients who were alive without breast cancer 
recurrence or who died due to a cause other than breast cancer were censored. This approach 
was used to evaluate the prognostic impact of individual genes and also sets of multiple genes. 

15 Analysis of 147 patients with invasive breast c arcinoma bv binary approach 

In the first (binary) approach, analysis was performed on all 146 patients with invasive 
breast carcinoma, A t test was performed on the group of patients classified as 0 or 1 and the p- 
valiies for the differences between the groups for each gene were calculated. 
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The following Table 4 lists the 45 genes for which the p- value for the differences between 
the groups was <0.05. 

Table 4 



Gene/ 

SEOIDNO: 


Mean CT 
Alive 


Mean CT 
Deceased 


t-value 


Degrees of 
freedom 


P 


FOXMl 


11 


TO co 


1 OO 


144 

l*T*t 


0 0001 


PRAME 


1^ A^ 


11 ftA 
j.3.©4 


171 


AAA 


o 0001 


BcI2 


oo ^o 
Zo.DZ 


oo io 
zy.3z 


1 <1 
-3.D3 


1 AA 


A A AAA 


STK15 


3U.oz 


1A 1A 


1 >l o 


1 44 


A AAA£ 


CEGP1 


OA 1 O 

29.12 


3U.86 


O OA 

-3.39 


1 AA 
144 


A AAA A - 


Ki-67 


30*57 


OA ^O 

29.62 


o o vi 

3.34 


■ 1 AA 

144 


A A A "1 1 

U.U011 


GSTMl 


OA ^O 

30.62 


31.63 


' O "OT ' 

-3.27 


144 


A AA1 A 


CA9 


34.96 


o o ^ it 
33.54 


O 1 o 

3.18 


1 A A 

144 


A AA lO : 

0.0018 


PR 


29.56 


O 1 oo 

3 1 .22 


O 1 c 

-3.16 


1 A A 

144 


A A A 1 A 
0.0019 


BBC3 


31.54 


OO 1 A 

32.10 


O 1 A 

-3.10 


1 A A 

144 


A AAOI 

0.UUZ3 


NME1 


27.31 


26.68 


O f\A 

3.04 


1 A A 

144 


A A AO ft 


SURV 


31.64 


OA 

30.68 


O AO 

2.92 


1 A 4 

144 


A Art/1 1 

U.UU41 


GATA3 


O^ A^ 

26.06 


O^C AA 

26.99 


O A1 

-Z.Vl 


1AA 

144 


A AA/IO 


TFRC 


28.96 


O O AO 

28.48 


O OO 

Z.8 / 


1 AA 

144 


A A A/17 


YB-1 


O/C TO 

zo. /z 


0/£ /l 1 

Zo.41 


O 70 

z. /y 


1 AA 
144 


A AAAA 


DPYD 


Z5.-> 1 


Oft ft/t 
Zo.54 


O /£7 

-z.o/ 


1 AA 


A AAftA 
U.UU54 




Oft O 1 
Zo.Zl 


oo ni 


O £1 


1 AA 


A AOQ^ 


KrboJsJJl 


11 1ft 
31 . lo 


ia A1 


0 

Z.O I 


1AA 


A OOQQ 

U.W77 


ore 


07 07 

z/.y/ 


07 AO 


o <.o 


1 AA 


A 01 0^ 




10 A1 


11 oo 


0 ^7 
Z.D / 


1 AA 


\J.\J 11 J 


1JJ1 


Oft 71 
Zo. /J 


OO 1 1 


O Aft 


1 AA 


A 01 A1 

U.U 1.41 


PctP 1 


04. 70 




-7 44 


144 


0 0160 


p27 


27.15 


27.51 


-2.41 


144 


0.0174 


CCNB1 


31.63 


30.87 


2.40 


144 


0.0176 


XIAP 


30.27 


30.51 


-2.40 


144 


0.0178 


Chk2 


31.48 


31.11 


2.39 


144 


0.0179 


CDC25B 


29.75 


29.39 


2.37 


144 


0.0193 


IGF1R 


28.85 


29.44 


-2.34 


144 


0.0209 
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60 



AK055699 


o^ 

33.23 


1A 11 


O Oft 


JL*T*t. 


0 0242 


PI3KC2A 


31.U7 


11 AO 


0 05 


144. 


0 0257 


TGFB3 


oa >io 


ZO.OJ 




144 


0.0258 


BAGI1 


OO vlA 


Oft 7^ 


0 04 


144 


ft 0269 


CYP3A4 


3X/U 


1^ io 


0 17 


144 


0 0317 


EpCAM 


/o. /3 


Oft 14 


0 16 


144 

1 IF 


0 0321 


VEGFC 


ii.zo ; 


11 ftO 


0 16 


144 


0 0326 


pS2 


Zo.yo 


in ao. 




144 


0 0341 


hENTl 


O*? 1 A 

27.19 




O 10 


144 


ft ftt<?*7 


WISP1 


*5 t OA 

3 1 .20 


1 1 Ad 


o i ft 


144 


ft ft^77 


HNF3A 


" 0*7 OA 

27.89 


Oft A/l 


o no 


1 44 


ft fttR4 


NFKBp65 


33.22 


11 OA 

. 33.oU 


0 Oft 


144 


ft 0^06 


BRCA2 


33.06 


lO AO 

32.62 


O Aft 


144 


ft ft1Q7 




Jl/.vO 


30 13 


2.06 


144 


0.0414 


TKl 


32.27 


31.72 


2.02 


144 


0.0453 


VDR 


30.08 


29.73 


1.99 


144 


0.0488 



In the foregoing Table 4, lower (negative) t-values indicate higher expression (or lower 
CTs), associated with better outcomes, and, inversely, higher (positive) t-values indicate higher 
expression (lower CTs) associated with worse outcomes. Thus, for example, elevated expression 
5 of the FOXM1 gene (t-value = 3.92, CT mean alive> CT mean deceased) indicates a reduced 
likelihood of disease free survival. Similarly, elevated expression of the CEGP1 gene (t-value = - 
3.39; CT mean alive< CT mean deceased) indicates an increased likelihood of disease free 
sviivival. 

Based on the data set forth in Table 4, the overexpression of any of the following genes in 
0 breast cancer indicates a reduced likelihood of survival without cancer recurrence following 
surgery: FOXM1; PRAME; SKT15, Ki-67; CA9; NME1; SURV; TFRC; YB-1; RPS6KB1; Src; 
Chkl; CCNB1; Chk2; CDC25B; CYP3A4; EpCAM; VEGFC; hENTl; BRCA2; EGFR; TKl; 
VDR. 

Based on the data set forth in Table 4, the overexpression of any of the following genes in 
15 breast cancer indicates a better prognosis for survival without cancer recurrence following 
surgery: Blcl2; CEGP1; GSTM1; PR; BBC3; GATA3; DPYD; GSTM3; ID1; EstRl ; p27; XIAP; 
IGF1R; AK055699; P13KC2A; TGFB3; BAGI1 ; pS2; WISP1 ; HNF3A; NFKBp65. 
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Analysis ofl08ER positive patient by binary approach 

108 patients with normalized CT for estrogen receptor (ER) < 25.2 (i.e., ER positive 
patients) were subjected to separate analysis. A t test was performed on the groups of 
patients classified as 0 or 1 and the p-yalues for the differences between the groups for 
5 each gene were calculated. The following Table 5 lists the 12 genes where the p-value 

for the differences between the groups was <0.05. 

Table 5 



Gene/ 

SEOIDNO: 


MeanCT 
Alive 


MeanCT 
Deceased 


t-value 


Degrees of 
freedom 


p 


PRAME 


35.54 


33.88 


3.03 


106 


0.0031 


Bcl2 


28.24 


28.87 


-2.70 


106 


0.0082 


FOXM1 


33.82 


32.85 


2.66 


106 


0.089 


DIABLO 


30.33 


30.71 


-2.47 


106 


0.0153 


EPHX1 


28.62 


28.03 


2.44 


106 


0.0163 


HIF1A 


29.37 


28.88 


2.40 


106 


0.0180 


VEGFC 


32.3? 


31.69 


2.39 ■ 


106 


0.0187 


Ki-67 


30.73 


29.82 


2.38 


106 


0.0191 


IGF1R 


28.60 


29.18 


-2.37 


106 


0.0194 


VDR 


30.14 


29.60 


2.17 


106 


0.0322 


NME1 


27.34 


26.80 


2.03 


106 


0.0452 


GSTM3 


28.08 


28.92 


-2.00 


106 


0.0485. 



10 

For each gene, a classification algorithm was utilized to identify the best threshold value 

(CT) for using each gene alone in predicting clinical outcome. 

Based on the data set forth in Table 5, overexpression of the following genes in ER- 

positive cancer is indicative of a reduced likelihood of survival without cancer recurrence 

15 following surgery: PRAME; FOXM1; EPHX1; HIF1A; VEGFC; Ki-67; VDR; NMEL Some of 

these genes (PRAME; FOXM1; VEGFC; Ki-67; VDR; and NME1) were also identified as 

indicators of poor prognosis in the previous analysis, not limited to ER-positive breast cancer. 

The overexpression of the remaining genes (EPHX1 and HIF1A) appears to be negative 

indicator of disease free survival in ER-positive breast cancer only. Based on the data set forth in 

20 Table 5, overexpression of the following gen~s in ER-positive cancer is indicative of a better 

61 

SUBSTITUTE SHEET (RULE 26) 

: <WO 03078662A1_I_> 



WO 03/078662 PCT/US03/07713 

prognosis for survival without cancer recurrence following surgery: Bcl-2; PIABLO;_IGFlR; 
GSTM3. Of the latter genes, Bcl-2;IGFR1; and GSTM3 have also been identified as indicators 
of good prognosis in the previous analysis, not limited to ER-positive breast cancer. The 
overexpression of DIABLO appears to be positive indicator of disease free survival in ER- 
5 positive breast cancer only. 

Analysis of multiple eenes and indicators of outcome 

Two approaches were taken in order to determine whether using multiple genes would 
provide better discrimination between Outcomes. 

First, a discrimination analysis was performed using a forward stepwise approach. 
10 Models were generated that classified outcome with greater discrimination than was obtained 
with any single gene alone. 

According to a second approach (time-to-event approach), for each gene a Cox 
Proportional Hazards model (see, e.g. Cox, D. R., and Oakes, D. (1984), Analysis of Survival 
Data, Chapman and Hall, London, New York) was defined with time to recurrence or death as 
15 the dependent variable, and the expression level of the gene as the independent variable. The 
genes that have a p-value < 0.05 in the Cox model were identified. For each gene, the Cox 
model provides the relative risk (RR) of recurrence or death for a unit change in the expression of 
the gene. One can choose to partition the patients into subgroups at any threshold value of the 
measured expression (on the CT scale), where all patients with expression values above- the 
20 threshold have higher risk, and all patients with expression values below the threshold have 
lower risk, or vice versa, depending on whether the gene is an indicator of good (RR>1 .01) or 
poor (RR<1.01) prognosis. Thus, any threshold value will define subgroups of patients with 
respectively increased or decreased risk. The results are summarized in the following Tables 6 
and 7. , 

25 
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table 6 

Cox Model Results for 146 Patients with Invasive Breast Cancer 



Gene 


Relative Risk (RR) 


SE Relative Risk 


| p value 


FOXM1 


0.58 


0.15 


0.0002 


STK15 


0.51 


0.20 


0.0006 


PRAME 


0.78 


0.07 


0.0007 


Bcl2 


1.66 


0.15 


0.0009 


CEGP1 


1.25 


0.07 


0.0014 


GSTM1 


1.40 


0.11 


0.0014 


Ki67 


0.62 


0.15 


0.0016 


PR 


1.23 


0.07 


0.0017 


Contig51037 


0.81 


0.07 


0.0022 


NME1 


0.64 


0.15 


0.0023 


YB-1 


0.39 


0.32 


0.0033 


TFRC 


0.53 


0.21 


0.0035 


BBC3 


1.72 


0.19 


0.0036 


GATA3 


1.32 


0.10 


0.0039 


CA9 


0.81 


0.0? 


0.0049 


SURV 


0.69 


0.13 


0.0049 


DPYD 


2.58 


0.34 


0.0052 


RPS6KB1 


0.60 


0.18 


0.0055 


GSTM3 


1.36 


0.12 


0.0078 


Src.2 


0.39 


0.36 


0.0094 


TGFB3 


1.61 


0.19 


0.0109 


CDC25B 


0.54 


0.25 


0.0122 


XIAP 


3.20 


0.47 


0.0126 


CCNB1 


0.68 


0.16 


0.0151 


IGF1R 


1.42 


0.15 


0.0153 


Chkl 


0.68 


0.16 


0.0155 


ID1 


1.80 


0.25 


0.0164 


p27 


1.69 


0.22 


0.0168 


Chk2 


0.52 


0.27 


0.0175 
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EstRl 


1 1*7 
1.1/: 


n A7 

v.\J/ 


0.0196 


HNF3A 


1.Z1 


0 08 
v.v/o 


0.206 


pS2 


1.1Z i 


0 OS 


0.0230 


BAGI1 


1 .oo 


0 29 


0.0266 


AK055699 


1.24 


ft in 


0.0276 


pENTl 




0 31 


0.0293 


EpCAM 




0 22 


0.0310 


WISP1 


1.39 


v.lU 


0.0338 


VEGFC 


U.oz 


0 23 


0.0364 


TK1 


0-73 


0 1 S 


0.0382 


NFKBp65 


1.32 


0 14 
v. It* 


0.0384 


RDf A9 


0.66 


0.20 


0.0404 


CYP3A4 


0.60 


0.25 


0.0417 


EGFR 


0.72 


0.16 


0.0436 



Table 7 

Cox Mode) Results for 1 0R Patients wih ER+ Invasive Breast Cancer 



Gene 


Relative Risk (RR) 


SE Relative Risk 


p- value 


PRAME 


0.75 


0.10 


0.0045 


Contig51037 


0.75 


0.11 


0.0060 


Blc2 


2.11 


0.28 


0.0075 


HIF1A 


0.42 


0.34 


0.0117 


IGF1R 


1.92 


0.26 


0.0117 


FOXM1 


0.54 


0.24 


0.0119 


EPHX1 


0.43 


0.33 


0.0120 


Ki67 


0.60 


0.21 


0.0160 


CDC25B 


0.41 


0.38 


0.0200 


VEGFC 


0.45 


0.37 


0.0288 


CTSB 


0.32 


0.53 


0.0328 


DIABLO 


2.91 


0.50 


0.0328 


p27 


1.83 


0.28 


0.0341 


CDH1 


0.57 


0.27 


0.0352 


IGFBP3 


0.45 


0.40 


0.0499 
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The binary and time-to-event analyses, with few exceptions, identified the same genes as 
prognostic markers. For example, comparison of Tables 4 and 6 shows that, with the exception 
of a single gene, the two analyses generated the same list of top 15 markers (as defined by the 
5 smallest p values). Furthermore, when both analyses identified the same gene, they were 
concordant with respect to the direction (positive or negative sign) of the correlation with 
survival/recurrence. Overall, these results strengthen the conclusion that the identified markers 
have significant prognostic value. 

For Cox models comprising more than two genes (multivariate models), stepwise entry 

10 of each individual gene into the model is performed, where the first gene entered is pre-selected 
from among those genes having significant univariate p-values, and the^gene selected for entry 
into the model at each subsequent step is the gene that best improves the fit of the model to the 
data. This analysis can be performed with any total number of genes. In the analysis the results 
of which are shown below, stepwise entry was performed for 

15 Multivariate analysis is performed using the following equation: 

RR=exp[coef(geneA) x Ct(geneA) + coef(geneB) x Ct(geneB) + Coef(geneC) x 
Ct(geneC) + ...............]. = 

In this equation, coefficiencts for genes that are predictors of beneficial outcome are 
positive numbers and coefficients for genes that are predictors of unfavorable outcome are 

20 negative numbers. The "Ct" values in the equation are ACts, i.e. reflect the difference between 
the average normalized Ct value for a population and the normalized Ct measured for the patient 
in question. The convention used in the present analysis has been that ACts below and above the 
population average have positive signs and negative signs, respectively (reflecting greater or 
lesser mRNA abundance). The relative risk (RR) calculated by solving this equation will 

25 indicate if the patient has an enhanced or reduced chance of long-term survival without cancer 
recurrence. 

Multivariate zene analysis of ] 47 patients with invasive breast carcinoma 
(a) A multivariate stepwise analysis, using the Cox Proportional Hazards Model, was 
30 performed on the gene expression data obtained for all 147 patients with invasive breast 
carcinoma. Genes CEGP1, FOXM1, STK15 and PRAME were excluded from this analysis. 
The following ten-gene sets have been identified by this analysis as having particularly strong 
predictive value of patient survival without cancer recurrence following surgical removal of 
primary tumor. 
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1. Bcl2, cyclinGl,NFKBp65,NMEl,EPHXl,TOP2B, DR5, TERC, Src, 
DIABLO; 

1 Ki67, XIAP, hENTl , TS, CD9, p27, cyclinGl , pS2,NFKBp65, CYP3A4; 
3 . GSTM1, XIAP, Ki67, TS, cyclinGl ,p27, CYP3A4, pS2, NFKBp65, 
'. ErbB3; •• 

4: PR, NME1, XIAP, upa, cyclinGl, Contig51037, TERC, EPHX1, 
ALDH1 A3, CTSL; 

5. CA9, NME1, TERC, cyclinGl, EPHX1 , DPYD, Src, TOP2B, NFKBp65, 
VEGFC; 

6. TFRC, XIAP, Ki67, TS, cyclinGl, p27, CYP3A4, pS2, ErbB3, NFKBp65. 

(b) A multivariate stepwise analysis, using the Cox Proportional Hazards Model, was 
performed on the gene expression data obtained for all 147 patients with invasive breast 
carcinoma, using an interrogation set including a reduced number of genes. The following ten- 
gene sets have been identified by this analysis as having particularly strong predictive value of 
patient survival without cancer recurrence following surgical removal of primary tumor. 

1 . Bcl2, PRAME, cyclinGl, FOXM1, NFlCBp65, TS, XIAP, Ki67, CYP3A4, 
P 27; 

2. FOXM1, cyclinGl, XIAP, Contig51037, PRAME, TS, KJ67, PDGFRa, 
P 27,NFKBp65; 

3. PRAME, FOXM1, cyclinGl, XIAP, Contig51037, TS, Ki6, PDGFRa, p27, 
NFKBp65; 

4. Ki67, XIAP, PRAME, hENTl, contig51037, TS, CD9, p27, ErbB3, cyclinGl; 

5. STK1 5, XIAP, PRAME, PLAUR, p27, CTSL, CD1 8, PREP, p53, RPS6KB1; 

6. GSTM1, XIAP, PRAME, p27, Contig51037, ErbB3, GSTp, EREG, ID1, 
PLAUR; 

7. PR, PRAME, NME1, XIAP, PLAUR, cyclinGl, Contig51037, TERC, 
EPHX1,DR5; 

8. CA9, FOXM1, cyclinGl, XIAP, TS, Ki67, NFKBp65, CYP3A4, GSTM3, 
p27; 

9. TFRC, XIAP, PRAME, p27, Contig5 1 037, ErbB3, DPYD, TERC, NME1 , 
VEGFC; 

10. CEGP1, PRAME, hENTl , XIAP, Contig51037, ErbB3, DPYD, NFKBp65, 
ID1,TS. 

66 

SUBSTITUTE SHEET (RULE 26) 

VO 03078662A1 J_> 



WO 03/078662 PCT/US03/07713 
Multivariate analysis of patients with ER positive invasive breast carcinoma 
A multivariate stepwise analysis, using the Cox Proportional Hazards Model, was 
performed on the gene expression data obtained for patients with ER positive invasive breast 
carcinoma. The following ten-gene sets have, been identified by this analysis as having 
5 particularly strong predictive value of patient survival without cancer recurrence following 
surgical removal of primary tumor. 

1. PRAME, p27, IGFBP2, HIF1 A, TIMP2, ILT2, CYP3A4, ID1, EstRl, 
DIABLO; 

2. Contig51037, EPHX1, Ki67, TIMP2, cyclinGl, DPYD, CYP3A4, TP, AIB1, 
10 CYP2C8; 

3. Bcl2, hENTl, FOXM1 , Contig51037, cyclinGl , Contig46653, PTEN, 
CYP3A4, TIMP2, AREG; 

4. HDF1A, PRAME, p27, IGFBP2, TIMP2, ILT2, CYP3A4, ID1, EstRl, 
DIABLO; 

15 5. IGF1R, PRAME, EPHX1, Contig5 1037, cyclinGl, Bcl2, NME1, PTEN, TBP, 

TIMP2; 

6. FOXM1 , Contig5 1 037, VEGFC, TBP, HIF1A, DPYD, RAD5 1C, DCR3, 
cyclinGl, BAG1; ' •• 

7. EPHX1, Contig51 037, Ki67, TIMP2, cyclinGl, DPYD, CYP3A4, TP, AIB1, 
20 CYP2C8; 

8. Ki67, VEGFC, VDR, GSTM3, p27, upa, ITGA7, rhoC, TERC, Pinl ; 

9. CDC25B,Contig51037, hENTl, Bcl2, HLAG, TERC, NMEK upa, ID1, CYP; 

10. VEGFC, Ki67, VDR, GSTM3,p27, upa, ITGA7,rhoC, TERC, Pinl; 

1 1 . CTSB, PRAME, p27, IGFBP2, EPHX1 , CTSL, BAD, DR5; DCR3, XIAP; 
25 12. DIABLO, Ki67, hENTl , TIMP2, ID 1 , p27, KRT1 9, IGFBP2, IS, PDGFB; 

13. p27, PRAME, IGFBP2, HIF1A, TIMP2, ILT2, CYP3A4, ID1, EstRl, 
DIABLO; 

14. CDH1 ; PRAME, VEGFC; HIF1 A; DPYD, TIMP2, CYP3A4, EstRl , RBP4, 
p27; 

30 15. 1GFBP3, PRAME, p27, Bcl2, XIAP, EstRl , Ki67, TS, Src, VEGF; 

16. GSTM3, PRAME, p27,IGFBP3, XIAP, FGF2, hENTl, PTEN, EstRl ,APC; 

17. hENTl, Bcl2, FOXM1, Contig51037, CyclinGl, Contig46653, PTEN, 
CYP3A4, TIMP2, AREG; 

18. STK15, VEGFC, PRAME, p27, GCLC, hENTl, ID1, TIMP2, EstRl, MCP1; 
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19. NME1 , PRAM, p27, IGFBP3, XIAP, PTEN, riENTl , Bcl2, CYP3A4, HLAG; 

20. VDR, Bcl2,p27, hENTl, p53,PI3KC2A, E1F4E, TFRC, MCM3, ID1; 

21. EIF4E, Contig51037, EPHX1, cyclinGl, Bcl2, DR5, TBP, PTEN, NME1, 
' ' HER2 ; • 

5 2 2. GCNB1, PRAME, VEGFC, HIF1A, hENTl, GCLC, TIMP2, ITM, p27, upa; 

23. ID1 , PRAME, DIABLO, hENTl , p27, PDGFRa, NME1 , BIN1 , BRCA1 , TP; 

24. FBX05, PRAME, IGFBP3, p27, GSTM3, hENTl , XIAP, FGF2, TS, PTEN; 

25. GUS, HIA1A, VEGFC, GSTM3, DPYD, hENTl, FBX05, CA9, CYP, 
KRT18; 

10 26. Bclx, Bcl2, hENTl, Contig51037, HLAG, CD9, ID1, BR.CA1, BIN1, 

HBEGF. 

It is noteworthy that many of the foregoing gene sets include genes that alone did not 
have sufficient predictive value to qualify as prognostic markers under the standards discussed 
above, but in combination with other genes, their presence provides valuable information about 
1 5 the likelihood of long-term patient survival without cancer recurrence 

All references cited throughout the disclosure are hereby expressly incorporated by 

reference. 

While the present invention has been described with reference to what are considered to 
be the specific embodiments, it is to be understood that the invention is not limited to such 
20 embodiments. To the contrary, the invention is intended to cover various modifications and 
equivalents included within the spirit and scope of the appended claims. For example, while the 
disclosure focuses on the identification of various breast cancer associated genes and gene sets, 
and on the diagnosis and treatment of breast cancer, similar genes, gene sets and methods 
concerning other types of cancer are specifically within the scope herein. 

25 
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WHAT IS CLAIMED IS : 

1. A method for predicting clinical outcome for a patient diagnosed with cancer, 
comprising 

determining the expression level of one or more genes, or their expression products* 
5 selected from the group consisting of p53BP2, cathepsin B, cathepsin L, KJ67/MiBl, and 
thymidine kinase in a cancer tissue obtained from the patient, normalized against a control gene 
or genes, and compared to the amount found in a reference cancer tissue set, 

wherein a poor outcome is predicted if: 

(a) the expression level of p53BP2 is in the lower 10 th percentile; or 
10 (b) the expression level of either cathepsin B or cathepsin L is in the upper 10 th 

percentile; or 

(c) the expression level of any either Ki67/MiBl or thymidine kinase is in the upper 
10* percentile. 

15 2. The method of claim 1 wherein poor clinical outcome is measured in terms of 

shortened survival or increased risk of cancer recurrence. 

3. The method of claim 2 wherein poor clinical outcome is measured in terms of 
shortened survival or increased risk of cancer recurrence following surgical removal of the 

20 cancer. 

4. The method of claim 1 wherein the cancer is selected from the group consisting of 
breast cancer, colon cancer, lung cancer, prostate cancer, hepatocellular cancer, gastric cancer, 
pancreatic cancer, cervical cancer, ovarian cancer, liver cancer, bladder cancer, cancer of the 

25 urinary tract, thyroid cancer, renal cancer, carcinoma, melanoma, and brain cancer. 

5. The method of claim 4 wherein the cancer is breast cancer. 

6. The method of claim 5 wherein the expression level of p53BP2 is determined. 



30 



7. The method of claim 5 wherein the expression levels of cathepsin B and cathepsin 
L are determined. 

8. The method of claim 5 wherein the expression level of cathepsin L is determined. 
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9. The method of claim 5 wherein the expression levels of Ki67/MiBl and 
thymidine kinase are determined. 

5 10. The method of claim 5 wherein the expression level of Ki67/MiBl is determined. 

11. The method of claim 5 wherein the expression level of thymidine kinase is 
determined. 

10 ]2. The method of claim 1 wherein the expression level of more than one gene, or 

gene product, is determined. 

13. The method of claim 1 wherein the expression level of more than two genes is 
determined. 

15 

14. The method of claim 13 further comprising the step of subjecting the expression 
data to multivariate analysis using the Cox Proportional Hazards model. 

15. The method of claim 1 wherein the expression level is determined using RNA 
20 obtained from a formalin-fixed, paraffin-embedded tissue sample. 

16. The method of claim 1 wherein the expression level is determined by reverse 
phase polymerase chain reaction (RT-PCR). 

25 17. The method of claim 16 wherein said RN A is fragmented. 

18. A method of predicting the likelihood of the recurrence of cancer following 
treatment in a cancer patient, comprising determining the expression level of p27, or its 
expression product, in a cancer tissue obtained from said patient, normalized against a control 

30 gene or genes, and compared to the amount found in a reference cancer tissue set, wherein an 
expression level in the upper 10th percentile indicates decreased risk of recurrence following 
treatment. 

19. The method of claim 18 wherein the cancer is selected from the group consisting 
35 of breast cancer, colon cancer, lung cancer, prostate cancer, hepatocellular cancer, gastric cancer, 
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pancreatic cancer, cervical cancer, ovarian cancer, liver cancer, bladder cancer, cancer of the 
urinary tract, thyroid cancer, renal cancer, carcinoma, melanoma, and brain cancer. 

20. The method of claim 19 wherein the cancer is breast cancer. 

• 5- ■ . ■ - : • ■ " . . . . . . . • • ' . 

21- The method of claim 20 wherein the expression level is determined following 
surgical removal of cancer. 

22. The method of claim 20 wherein the expression level is determined using RNA 
10 obtained from a formalin-fixed, paraffin-embedded tissue sample. 

23. The method of claim 22 wherein said RNA is fragmented. 

24. The method of claim 22 wherein the expression level is determined by reverse 
15 phase polymerase chain reaction (RT-PCR). 

25. A method for classifying cancer comprising, determining the expression level of 
two or more genes selected from the group consisting of Bcl2, hepatocyte nuclear factor 3, ER, 
ErbB2 and Grb7, or their expression products, in a cancer tissue, normalized against a control 

20 gene or genes, and compared to the amount found in a reference cancer tissue set, wherein (i) 
tumors expressing at least one of Bcl2, hepatocyte nuclear factor 3, and ER, or their expression 
products, above the mean expression level in the reference tissue set are classified as having a 
good prognosis for disease free and overall patient survival following treatment; and (ii) tumors 
expressing elevated levels of ErbB2 and Grb7, or their expression products, at levels ten-fold or 

25 more above the mean expression level in the reference tissue set are classified as having poor 
prognosis of disease free and overall patient survival following treatment. 

26. The method of claim 26 wherein the cancer is selected from the group consisting 
of breast cancer, colon cancer, lung cancer, prostate cancer, hepatocellular cancer, gastric cancer, 

30 pancreatic cancer, cervical cancer, ovarian cancer, liver cancer, bladder cancer, cancer of the 
urinary tract, thyroid cancer, renal cancer, carcinoma, melanoma, and brain cancer. 

27. The method of claim 26 wherein the cancer is breast cancer. 
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28. The method of claim 26 wherein the expression lever is determined following 
. surgical removal of cancer. ; 

29. The method of claim 26 wherein the expression level is determined using KNA 
5 obtained from a formalin-fixed, paraffin-embedded tissue sample. 

30. The method of claim 29 wherein said RNA is fragmented. 

31. The method of claim 29 wherein the expression level is determined by reverse 
10 phase polymerase chain reaction (RT-PCR). 

32. A method of predicting the likelihood of long-term survival of a breast cancer 
patient without the recurrence of breast cancer, following surgical removal of the primary tumor, 
comprising determining the expression level of one or more prognostic RNA transcripts or their 

15 product in a breast cancer tissue sample obtained from said patient, normalized against the 
expression level of all RNA transcripts or their products in said breast cancer tissue sample, or of 
a reference set of RNA transcripts or their products, wherein the prognostic transcript is the 
transcript of one or more genes selected from the group consisting of: FOXM1 , PRAME, Bcl2, 
STK15, CEGP1, Ki-67, GSTM1, CA9, PR, BBC3, NME1, SURV, GATA3, TFRC, YB-1, 
20 DPYD, GSTM3, RPS6KB1, Src, Chkl, ID1, EstRl, p27, CCNB1, XIAP, Chk2, CDG25B, 
1GF1R, AK055699, P13KC2A, . TGFB3, BAGI1, CYP3A4, EpCAM, VEGFC, pS2, hENTl, 
WISP1, HNF3A, NFKBp65, BRCA2, EGFR, TK1, VDR, Contig51037, pENTl, EPHX1, IF1A, 
DIABLO, CDH1, HIFla, IGFBP3, CTSB, and Her2, wherein overexpression of one or more of 
FOXM1, PRAME, STK15, Ki-67, CA9, NME1, SURV, TFRC, YB-1, RPS6KB1, Src, Chkl, 
25 CCNB1, Chk2, CDC25B, CYP3A4, EpCAM, VEGFC, hENTl, BRCA2, EGFR, TK1, VDR, 
EPHX1, IF1A, Contig51037, CDH1, HIFla, IGFBP3, CTSB, Her2, and pENTl indicates a 
decreased likelihood of long-term survival without breast cancer recurrence, and the 
overexpression of one or more of Bcl2, CEGP1, GSTM1, PR, BBC3, GATA3, DPYD, GSTM3, 
ID1, EstRl, P 27, XIAP, IGF1R, AK055699, P13KC2A, TGFB3, BAGI1, pS2, WISP1, HNF3A, 
30 NFKBp65, and DIABLO indicates an increased likelihood of long-term survival without breast 
cancer recurrence. 

33. The method of claim 32 comprising determining the expression level of at least 
two of said prognostic transcipts or their expression products. 
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34. The method of claim 32 wherein the breast cancer is invasive breast carcinoma, 
comprising determination of the expression levels of the transcripts of the following genes, or 
their expression products: FOXM1, PRAME, Bcl2, STK15, CEGP1, Ki-67, GSTM1, PR, 
5 BBC3, NME1, SURV, GATA3, TFRC, YB-1, DPYD, CA9, Contig51037, RPS6K1 and Her2. 

35.. The method of claim 32 wherein said breast cancer is characterized by 
overexpression of the estrogen receptor (ER). 

10 36. The method of claim 35 comprising detenniriationof "the expression levels of the 

transcripts of at least two of the following genes, or their expression products: PRAME, Bcl2, 
FOXM1, DIABLO, EPHX1, HIF1A, VEGFC, Ki-67, IGF1R, VDR, NME1, GSTM3, 
Contig5 1 037, CDC25B, CTSB, p27, CDH 1 , and IGFBP3 . 

15 37, The method of claim 32 wherein the expression level of one or more prognostic 

RNA transcripts is determined. 

38. The method of claim 37 wherein said RNA is isolated from a fixed, wax- 
embedded breast cancer tissue specimen of said patient. 

20 

39. An array comprising polynucleotides hybridizing to the following genes: 
FOXM1, PRAME, Bcl2, STK15, CEGP1, Ki-67, GSTM1, PR, BBC3, NME1, SURV, GAT A3, 
TFRC, YB-1, DPYD, CA9, ContigS 1 037, RPS6K1 and Her2, immobilized on a solid surface. 

25 40. The array of claim 39 comprising polynucleotides hybridizing to the following 

genes: FOXM1, PRAME, Bcl2, STK15, CEGP1, Ki-67, GSTM1, CA9, PR, BBC3, NME1, 
SURV, GATA3, TFRC, YB-1, DPYD, GSTM3, RPS6KB1, Src, Chkl, ID1, EstRl, p27, 
CCNB1, XIAP, Chk2, CDC25B, IGF1R, AK055699, P13KC2A, TGFB3, BAGI1, CYP3A4, 
EpCAM, VEGFC, pS2, hENTl, WISP1, HNF3A, NFKBp65, BRCA2, EGFR, TK1, VDR, 

30 Contig51037, pENTl, EPHX1, IF1A, CDH1, HIFla, IGFBP3, CTSB, Her2 and DIABLO, 
immobilized on a solid surface. 
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41. A method of predicting the likelihood of long-term survival of a patient 
diagnosed with invasive breast cancer, without the recurrence of breast cancer, following surgical 
removal of the primary tumor, comprising the steps of :• .. .. 

(1) determining the expression levels of the RNA transcripts or the expression 
5 products of genes of a gene set selected from the group consisting of 

(a) Bcl2, cyclinGl, NFKBp65, NME1, EPHX1, TOP2B, DR5, TERC, Src, DIABLO; 

(b) Ki67, XIAP, hENTl, TS, CD9, p27, cyclinGl, pS2, NFKBp65, CYP3A4; 

(c) GSTM1, XIAP, Ki67, TS, cyclinGl, p27, CYP3A4, pS2, NFKBp65, 
ErbB3; 

10 (d) PR, NME1 , XIAP, upa, cyclinGl, Contig5 1037, TERC, EPHX1, 

ALDH1A3, CTSL; 

(e) CA9, NME1, TERC, cyclinGl, EPHX1, DPYD, Src, TOP2B, NFKBp65, 
VEGFC; ' 

(f) TFRC, XIAP, Ki67, TS, cyclinGl, p27, CYP3A4, pS2, ErbB3, NFKBp65; 
15 (g) BcI2, PRAME, cyclinGl, FOXM1, NFKBp65, TS, XIAP, Ki67, CYP3A4, 

■ ... ; p27; 

(h) FOXM1, cyclinGl, XIAP, Contig5 1037, PRAME, TS, KJ67, PDGFRa, 
p27, NFKBp65; 

(i) PRAME, FOXM1, cyclinGl, XIAP, Contig51037, TS, Ki6, PDGFRa, p27, 
20 NFKBp65; 

0) Ki67, XIAP, PRAME, hENTl, contig51037, TS, CD9, p27, ErbB3, cyclinGl; 
(k) STK15, XIAP, PRAME, PLAUR,p27, CTSL, CD1 8, PREP, p53,RPS6KBl; 
0) GSTM1, XIAP, PRAME, p27,Contig51 037, ErbB3,GSTp,EREG,IDl, 
PLAUR; 

25 (m) PR, PRAME, NME1 , XIAP, PLAUR, cyclinGl, ContigS 1037, TERC, 

EPHX1,DR5; 

(n) CA9, FOXMI , cyclinGl , XIAP, TS, Ki67, NFKBp65, CYP3A4, GSTM3, 
P27; 

(o) TFRC, XIAP, PRAME, p27, Contig51037, ErbB3, DPYD, TERC, NME1, 
30 VEGFC; and 

(p) CEGP1, PRAME hENTl, XIAP, Contig51037,ErbB3, DPYD, NFKBp65, 
ID1,TS 
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in a breast cancer tissue sample obtained from said patient, normalized against the 
expression levels of all RNA transcripts or their products in said breast cancer tissue sample, or 
of a reference set of RNA transcripts or their products; 

(2) subjecting the data obtained in step (a) to statistical analysis; and 
5 (3) determining whether the likelihood of said long-term survival has 

increased or decreased. 

42. A method of predicting the likelihood of long-term survival of a patient 
diagnosed with estrogen receptor (ER)-positive invasive breast cancer, without the recurrence of 
10 breast cancer, following surgical removal of the primary tumor, comprising the steps of: 

(1) determining the expression levels of the RNA transcripts or die expression 
products of genes of a gene set selected from the group consisting of 

(a) PRAME, p27, IGFBP2, HIF1A, TIMP2, ILT2, CYP3A4, ID1 ,. EstRl , 
DIABLO; 

15 (b) ConugS1037, EPHX1, Ki67, TIMP2, cyclinGl, DPYD, CYP3A4, TP, AIB1, 

CYP2C8; 

(c) Bcl2, hENTl, FOXM1, Contig51037, cyclinGl, Contig46653, PTEN, 
CYP3A4, TIMP2, AREG; 

(d) HIF1A, PRAME, p27, IGFBP2, TIMP2, ILT2, CYP3A4, ID1, EstRl, 
20 DIABLO; 

(e) IGF1R, PRAME, EPHX1 , Contig51 037, cyclinGl , Bcl2, NME1 , PTEN, TBP, 
TIMP2; 

(f) FOXM1, Contig51037, VEGFC, TBP, fflFIA, DPYD, RAD51C, DCR3, 
cyclinGl, BAG 1; 

25 (g) EPHXl,Contig51037,Ki67,TIMP2, cyclinGl, DPYD, CYP3A4, TP, AIB1, 

CYP2C8; 

(h) Ki67, VEGFC, VDR, GSTM3, p27, upa, ITGA7, rhoC, TERC, Pinl; 

(i) CDC25B, Contig51037, hENTl, Bcl2, HLAG, TERC, NME1, upa, ID1, CYP; 
(j) VEGFC, Ki67, VDR, GSTM3, p27, upa, ITGA7, rhoC, TERC, Pinl ; 

30 (k) CTSB, PRAME, p27, IGFBP2, EPHX1 , CTSL, BAD, DR5, DCR3, XIAP; 

(1) DIABLO, Ki67, hENTl , TIMP2, ID1 , p27, KRT1 9, IGFBP2, TS, PDGFB; 
(m) p27, PRAME, IGFBP2, HIF 1 A, TIMP2, ILT2, CYP3A4, ID 1 , EstRl , 
DIABLO; 

75 

SUBSTITUTE SHEET (RULE 26) 

BNSDOCID: <WO 03078662A1_I_> 



WO 03/078662 PCT/US03/07713 

(n) CDH1 ; PRAME, VEGFC; HIF1A; DPYD, TIMP2; CYP3A4, EstRl , RBP4, 

P 27; ' ; . . . : ' : \ 

(6) IQFBP3, PRAME, p27, Bcl2, XIAP, EstRl, Ki67, TS, Src, VEGF; 
(p) GSTM3, PRAME, p27, IGFBP3, XIAP, FGF2, hENTl, PTEN, EstRl, APC; 
(q) hENTl , Bcl2, FOXM1, Contig51 037, CyclinGl, Cphtig46653, PTEN, 

CYP3A4, TIMP2, AREG; 
(r) STK15, VEGFC, PRAME, p27, GCLC, hENTl, 01, TIMP2, EstRl, MCP1; 
(s) NME1, PRAM, p27, IGFBP3, XIAP, PTEN, hENTl, Bcl2, CYP3A4, HLAG; 
(t) VDR, Bcl2, p27, hENTl, p53, PI3KC2A, EIF4E, TFRC, MCM3, ID1; 
(u) EIF4E, Contig51037, EPHX1, cyclinGl, Bcl2, DR5, TBP, PTEN, NME1, 
HER2; ■•' 

(v) CCNB1, PRAME, VEGFC, HIF1 A, hENTl , GCLC, TIMP2, ID1, p27, upa; 
(w) BD1, PRAME, DIABLO, hENTl, p27, PDGFRa, NME1, BIN1, BRCA1, TP; 
(x). FBX05, PRAME, IGFBP3, p27, GSTM3, hENTl, XIAP, FGF2, TS, PTEN; 
(y) GUS, HIAlA, VEGFC, GSTM3, DPYD, hENTl, EBXQ5, CA9, CYP, 
KRT18; and 

(z) Bclx, Bcl2, hENTl, Contig51037, HLAG, CD9, ID1, BRCA1, BIN1, 
HBEGF; 

(2) subjecting the data obtained in step (1) to statistical analysis; and 

(3) determining whether the likelihood of said long-term survival has 
increased or decreased. 

43. The method of claim 41 or claim 42 wherein said statistical analysis is performed 
by using the Cox Proportional Hazards model. 

44. An array comprising polynucleotides hybridizing to a gene set selected from the 
group consisting of 

(a) Bcl2, cyclinGl , NFKBp65, NME1 , EPHX1 , TOP2B, DR5, TERC, Src, DIABLO; 

(b) Ki67, XIAP, hENTl, TS, CD9, p27, cyclinGl , pS2, NFKBp65, CYP3A4; 

(c) GSTM1, XIAP, Ki67,TS, cyclinGl, p27, CYP3 A4, pS2, NFKBp65, 
ErbB3; 

(d) PR,NME1,XIAP, upa, cyclinGl, Contig51037, TERC, EPHX1, 
ALDH1A3, CTSL; 
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(e) CA9, NME1, TERC, cyclinGl , EPHX1, DPYD, Src, TOP2B, NFKBp65, 
"'. ■■ ■ VEGFC; - 

(f) TFRC, XIAP, Ki67, TS, cyclinGl, p27, CYP3A4, pS2, ErbB3, NFKBp65; 

(g) Bcl2, PRAME, cyclinGl, FOXM1, NFKBp65, TS, XIAP, Ki67, CYP3A4, 
5 ' p27; 

(h) FOXM1, cyclinGl, XIAP, Contig51037, PRAME, TS, Ki67, PDGFRa, 
p27,NFKBp65; 

(i) PRAME, FOXM1, cyclinGl, XIAP, Contig51037,TS,Ki6, PDGFRa, p27, 
NFKBp65; 

10 (j) Ki67, XIAP, PRAME, hENTl, contig51037, TS, CD9, p27, ErbB3, cyclinGl; 

(k) STK15, XIAP, PRAME, PLAUR, p27, CTSL, CD 18, PREP, p53, RPS6KB1; 
(1) GSTMl, XIAP, PRAME, p27, Gontig5 1037, ErbB3, GSTp, EREG, ID1, 
PLAUR; 

(m) PR, PRAME, NME1, XIAP, PLAUR, cyclinGl, Contig5 1037, TERC, 
15 EPHX1, DR5; 

(n) CA9, FOXM1, cyclinGl, XIAP, TS, Ki67, NFKBp65, CYP3A4, GSTM3, 
p27; 

(o) TFRC, XIAP, PRAME, p27, Contig5 1037, ErbB3, DPYD, TERC, NME1 
VEGFC; and 

20 (p) CEGP1, PRAME, bENTl , XIAP, Contig51037, ErbB3, DPYD, NFKBp65, 

ID1,TS, 
immobilized on a solid surface. 

45. An array comprising polynucleotides hybridizing to a gene set selected from the 
25 group consisting of 

(a) PRAME,p27,IGFBP2,HIFlA,TIMP2,ILT2,CYP3A4,IDl,EstRl, 
DIABLO; " 

(b) Contig51 037, EPHX1 , Ki67, TIMP2, cyclinGl, DPYD, CYP3A4, TP, AIB1, 
CYP2C8; 

30 (c) Bcl2, hENTl, FOXM1, Contig51037, cyclinGl, Contig46653, PTEN, 

CYP3A4, TIMP2, AREG; 
(d) HIF1A, PRAME, p27, IGFBP2, TIMP2, ILT2, CYP3A4, ID1 , EstRl , 
DIABLO; 

77 

SUBSTITUTE SHEET (RULE 26) 



BNSOOCID: <WO 03O78662A1_l_> 



WO 03/078662 PCT/US03/07713 

(e) IGF1R, PRAME, EPHX 1 , Contig51037, cyclinGl, Bcl2,NMEl,PTEN, TBP, 
TIMP2; ...... 

(f) FOXM1, ContigS1037, VEGFC, TBP, HIF1A, DPYD, RAD51C, DCR3, 
cyclinGl,BAGl; 

(g) EPHX1, Contig51037, Ki67, TIMP2, cyclinGl, DPYD, CYP3A4, TP, Affil, 
CYP2C8; 

(h) Ki67, VEGFC, VDR, GSTM3, p27, upa, ITGA7, rhoC, TERC, Pinl; 

(i) CDC25B, Contig51037, hENTl, Bcl2, HLAG, TERC, NME1, upa, ID1, CYP; 
(j) VEGFC, Ki67, VDR, GSTM3,p27, upa, ITGA7,rhoC, TERC, Pinl; 

(k) CTSB, PRAME, p27, IGFBP2, EPHX1 , CTSL, BAD, DR5, DCR3, XIAP; 
(1) DIABLO, Ki67, hENTl, TIMP2, ID1, P 27, KRT19, 1GFBP2, TS, PDGFB; 
(m) p27, PRAME, IGFBP2, H1F1A, TIMP2, ILT2, CYP3A4, ID1, EstRl, 
DIABLO; 

(n) CDH1 ; PRAME, VEGFC; HIF1A; DPYD, TIMP2, CYP3A4, EstRl , RBP4, 
, p27; ' 

(o) 1GFBP3, PRAME, p27, Bcl2, XIAP, EstRl, Ki67, TS, Src, VEGF; 

(p) GSTM3, PRAME, p27, IGFBP3, XIAP, FGF2, hENTl, PTEN, EstRl, APC; 

(q) hENTl, Bcl2, FOXM1, Contig51037, CyclinGl, Contig46653, PTEN, 

CYP3A4, TIMP2, AREG; 
(r) STK15, VEGFC, PRAME, p27, GCLC, hENTl, IDi, TIMP2, EstRl, MCP1; 
(s) NME1, PRAM, p27, IGFBP3, XIAP, PTEN, hENTl, Bcl2, CYP3A4, HLAG; 
- (t) VDR, Bcl2, p27, hENTl , p53, P13KC2A, EIF4E, TFRC, MCM3, IDI ; 
(u) EEF4E, Contig51037, EPHXl, cyclinGl, Bcl2, DR5, TBP, PTEN, NME1, 
HER2; 

(v) CCNB1 , PRAME, VEGFC, HIF1A, hENTl , GCLC, TIMP2, IDI , p27, upa; 
(w) IDI, PRAME, DIABLO, hENTl, p27, PDGFRa, NME1, BIN1, BRCA1, TP; 
(x) FBX05, PRAME, IGFBP3, p27, GSTM3, hENTl, XIAP, FGF2, TS, PTEN; 
(y) GUS.HIA1A, VEGFC, GSTM3, DPYD, hENTl, FBX05,CA9, CYP, 

KRTl8;and 

(z) Bclx, Bcl2, hENTl, Contig51037, HLAG, CD9, IDI, BRCA1, BDM1, 

HBEGF, 
immobilized on a solid surface. 
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SEQUENCE LISTING 

<110>* GENOMIC HEALTH 

Baker* Joffre B. 
Cronin, Maureen T. 
Kiefer, Michael c. 
Shak, Steve 

Walker, Mi chael Graham 
<120> GENE EXPRESSION PROFILING IN BIOPSIED TUMOR TISSUES 
<130> 39740-0001PCT 

<140> to be assigned 
<141> 2003-03-12 

<150> US 60/412,049 
<151> 2002-09-18 

<150> US 60/364,890 
<151> 2002-03-13 

<160> 384 

<170> FastSEQ for windows version 4.0 

<210> 1 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 1 18 
gtcccaggag cccatcct 

<210> 2 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 2 19 
cccggctgtt gtctccata 

<210> 3 

<211> 68 

<212> DNA 

<213> Homo sapiens 

g?22caggag cccatcctgt ttgactgcag cattgctgag aacattgcct atggagacaa 60 
cagccggg 

<210> 4 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 4 18 
tcatggtgcc cgtcaatg 

<210> 5 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 5 23 
cgattgtctt tgctcttcat gtg ^ 
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<210> .6 
<211> 79 
<212> DNA 

<213> Homo sapiens 
<400> 6 

tcatggtgcc cgtcaatgct: gtgatggcga tgaagaccaa gacgtatcag gtggcccaca 60 
tgaagagcaa agacaatcg 79 

<210> 7 . ' ' 

<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 7 

aggggatgac ttggacacat 20 

<210> 8 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 8 

aaaactgcat ggctttgtca 20 

<210> 9 
<211> 65 
<212> DNA 

<213> Homo sapiens 
<400> 9 

aggggatgac ttggacacat ctgccattcg acatgactgc aattttgaca aagccatgca 60 
gtttt ^ ^ 65 

<210> 10 
<211> 22 
<212> DNA 

<213> Homo sapiens 
<400> 10 

tcatcctggc gatctacttc ct 22 

<210> 11 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 11 

ccgttgagtg gaatcagcaa 20 

<210> 12 
<211> 91 
<212> DNA 

<213> Homo sapiens 
<400> 12 

tcatcctggc gatctacttc ctctggcaga acctaggtcc ctctgtcctg gctggagtcg 60 
ctttcatggt cttgctgatt ccactcaacg g ~ 91 

<210> 13 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 13 

agcgcctgga atctacaact 20 
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<210> 14 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 14 

agagcccctg gagagaagat 

<210> 15 

<211> 66 

<212> DNA 

<213> Homo sapiens 

ag2^ctgga atctacaact cggagtccag tgttttccca cttgtcatct tctctccagg 60 
ggctct Ob 

<210> 16 
<211> 24 
<212> DNA 

<213> Homo sapiens 

<400> 16 7A 
gcccagagaa ggtctatgaa ctca ^ 

<210> 17 
<211> 22 
<212> DNA 

<213> Homo sapiens 

<400> 17 ?? 
gtttcaaagg cttggtggat tt 

<210> 18 
<211> 94 
<212> DNA 

<213> Homo sapiens 

gcccagagaa ggtctatgaa ctcatgcgag catgttggca gtggaatccc tctgaccggc 60 
cctcctttgc tgaaatccac caagcctttg aaac ^ 

<2l6> 19 
<211> 21 
<212> DNA 

<213> Homo sapiens 

<400> 19 71 
cgcagtgcag ctgagtatct g " 

<210> 20 
<211> 21 
<212> DNA 

<213> Homo sapiens 

<400> 20 ?1 
tgcccagggc tactctcact t * x 

<210> 21 

<211> 80 

<212> DNA 

<213> Homo sapi ens 

cgcagtgcag ctgagtatct gctcagcagt ctaatcaatg gcagcttcct ggtgcgagaa 60 
agtgagagta gccctgggca 80 

<210> 22 , 
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<211> 21 
<212> DNA 

<213> Homo sapiens 
<400> 22 

cagcagatgt ggatcagcaa g 

<210> 23 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 23 

gcatttgcgg tggacgat 

<210> 24 
<211> 66 
<212> DNA 

<213> Homo sapiens 

cagcagatgt ggatcagcaa gcaggagtat gacgagtccg gcccctccat cgtccaccgc 60 
aaatgc 66 

<210> 25 
<211> 20 
<212> DNA 

<213> Homo sapiens 



18 



20 



20 



<400> 25 

cgcttctatg gcgctgagat 

<210> 26 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 26 

tcccggtaca ccacgttctt 

<210> 27 
<211> 71 

<212> DNA . 
<213> Homo sapiens 

<400> 27 . CA 

cgcttctatg gcgctgagat tgtgtcagcc ctggactacc tgcactcgga gaagaacgtg 60 

gtgtaccggg a 71 

<210> 28 

<211> 25 

<212> DNA 

<213> Homo sapiens 



<400> 28 

ttgtctctgc cttggactat ctaca 

<210> 29 
<211> 24 
<212> DNA 

<213> Homo sapiens 
<400> 29 

ccagcattag attctccaac ttga 

<210> 30 
<211> 75 
<212> DNA 



25 



24 
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<213> Homo sapiens 

ttgtctctgc cttggactat ctacattccg gaaagattgt gtaccgtgat ctcaagttgg €0 
agaatctaat gctgg 

<210> 31 
<211> 25 
<212> DNA 

<213> Homo sapiens 

<400> 31 75 
gaaggagata aggaggatgt tgaca 

<210> 32 
<211> 18 
<212> DNA 

<213> Homo sapiens 

<400> 32 is 
cgccacggag atccaatc 

<210> 33 
<211> 74 
<212> DNA 

<213> Homo sapiens 

gaaggagita aggaggatgt tgacaaggca gtgaaggccg caagacaggc ttttcagatt 60 
ggatctccgt ggcg 

<210> 34 . 
<211> 21 
<212> DNA 

<213> Homo sapiens 

<400> 34 21 
tggtgaacat tgtgccagga t 

<210> 35 
<211> 22 
<212> DNA 

<213> Homo sapiens 

<400> 35 22 
gaaggcgatc ttgttgatct ga 

<210> 36 
<211> 80 
<212> DNA 

<213> Homo sapiens 

tgg?gaa?at tgtgccagga ttcgggccca cagtgggagc agcaatttct tctcaccctc 60 
agatcaacaa gatcgccttc 

<210> 37 
<211> 20 
<212> DNA 

<213> Homo sapiens 

<400> 37 20 
ggacagcagg aatgtgtttc 

<210> 38 
<211> 20 
<212> DNA 

<213> Homo sapiens 
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<400> 38 

acccactcga tttgtttctg ^ 

<210> 39 
<211> 69 
<212> DNA 

<213> Homo sapiens 

ggacagcagg aatgtgtttc tccatacagg tcacggggag ccaatggttc agaaacaaat 60 
cgagtgggt by 

<210> 40 
<211> 27 
<212> DNA 

<213> Homo sapiens 



27 



27 



<400> 40 

tgtgagtgaa atgccttcta gtagtga 

<210> 41 
<211> 27 
<212> DNA 

<213> Homo sapiens 
<400> 41 

ttgtggttcg ttatcatact cttctga 

<210> 42 
<211> 82 
<212> DNA 

<213> Homo sapiens 

tgtgagtgaa atgccttcta gtagtgaacc gtcctcggga gccgactatg actactcaga 60 
agagtatgat aacgaaccac aa * l 

<210> 43 
<211> 19 
<212> DNA 

<213> Homo sapiens 



19 



24 



<400> 43 

gtctcgctcc gtggcctta 

<210> 44 
<211> 24 
<212> DNA 

<213> Homo sapiens 
<400> 44 

cgtgagtaaa cctgaatctt tgga 

<210> 45 
<211> 93 
<212> DNA 

<213> Homo sapiens 

gtctcgctcc gtggccttag ctgtgctcgc gctactctct ctttctggcc tggaggctat 60 
ccagcgtact ccaaagattc aggtttactc acg . 

<210> 46 
<211> 20 
<212> DNA 

<213> Homo sapiens 
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<400> 46 

ccattcccac cattctacct 

<210> 47 
<211> 20 
<212> DNA 
<213> Homo sapiens 

<400> 47 

gggaacatag acccaccaat 

<210> 48 

<211> 66 

<212> DNA . 

<213> Homo sapiens 

c1a?tcc1ac cattctacct gaggccagga cgtctggggt gtggggattg gtgggtctat 60 
gttccc J 

<210> 49 
<211> 18 
<212> DNA 

<213> Homo sapiens 



18 



21 



<400> 49 

ccgccgtgga cacagact 

<210> 50 • 
<211> 21 
<212> DNA 
<213> Homo sapiens 

<400> 50 

ttgccgtcag aaaacatgtc a 

<210> 51 

<211> 70 

<212> DNA 

<213> Homo sapiens 

ccgccgtgga cacagactcc ccccgagagg tctttttccg agtggcagct gacatgtttt 60 
ctgacggcaa 

<210> 52 
<211> 25 
<212> DNA 

<213> Homo sapiens 



25 



<400> 52 

cagatggacc tagtacccac tgaga 

<210> 53 
<211> 24 
<212> DNA 

<213> Homo sapiens 
<400> 53 

cctatgattt aagggcattt ttcc 

<210> 54 

<211> 73 

<212> DNA 

<213> Homo sapiens 

cagatggacc tagtacccac tgagatttcc acgccgaagg^ acagcgatgg gaaaaatgcc 60 
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cttaaatcat agg 73 

<210> 55 
<211> 24 
<212> DNA 

<213> Homo sapiens 
<400> 55 

cttttgtgga actctatggg aaca 24 

<210> 56 
<211> 19 
<212> DNA 

<213> Homo sapiens 
<400> 56 

cagcggttga agcgttcct 19 

<210> 57 
<211> 70 
<212> DNA 

<213> Homo sapiens 
<400> 57 

cttttgtgga actctatggg aacaatgcag cagccgagag ccgaaagggc caggaacgct 60 
tcaaccgctg 70 

<210> 58 

<211> 24 

<212> DNA •■ 

<213> Homo sapiens 

<400> 58 

ggatatttcc gtggctctta ttca 24 

<210> 59 
<211> 25 
<212> DNA 

<213> Homo sapi ens 
<400> 59 

cttctcatca aggcagaaaa atctt 25 

<210> 60 
<211> 86 
<212> DNA 

<213> Homo sapiens 
<400> 60 

ggatatttcc gtggctctta ttcaaactct ccatcaaatc ctgtaaactc cagagcaaat 60 
caagattttt ctgccttgat gagaag 86 

<210> 61 
<211> 23 
<212> DNA 

<213> Homo sapiens 



<400> 61 

gcagttggaa gacacaggaa agt 

<210> 62 
<211> 21 
<212> DNA 

<213> Homo sapiens 
<400> 62 

tgcgtggcac tattttcaag a 



23 



21 
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<210> 63 
<211> 77 
<212> DNA 

<213> Homo sapiens 

<400> 63 ^ 
gcagttggaa gacacaggaa agtatcccca aattgcagat ttatcaacgg cttttatctt 60 
gaaaatagtg ccacgca 77 

<210> 64 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 64 

tgttttgatt cccgggctta 20 

<210> 65 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 65 

caaagctgtc agctctagca aaag 24 

<210> 66 
<211> 80 
<212> DNA 

<213> Homo sapiens 

<400> 66 • 
tgttttgatt cccgggctta ccaggtgaga agtgagggag gaagaaggca gtgtcccttt 60 
tgctagagct gacagctttg 80 

<210> 67 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 67 

tcagggggct agaaatctgt 20 

<210> 68 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 68 

ccattccagt tgatctgtgg 20 

<210> 69 

<211> 65 

<212> DNA 

<213> Homo sapiens 

<400> 69 

tcagggggct agaaatctgt tgctatgggc ccttcaccaa catgcccaca gatcaactgg 60 
aatgg 65 

<210> 70 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 70 

agttcgtgct ttgcaagatg 20 
<210> 71 
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<211> 20 
<212> DNA 

<213> Homo sapiens 

<400> 71 20 
aaggtaagct gggtctgctg 

<210> 72 . 

<211> 70 

<212> DNA 

<213> Homo sapiens 

agttcgtgct ttgcaagatg gtgcagagct ttatgaagca gtgaagaatg cagcagaccc 60 
agcttacctt 

<210> 73 
<211> 21 
<212> DNA 

<213> Homo sapiens 



21 



<400> 73 

gcatgttcgt ggcctctiaag a 

<210> 74 
<211> 22 
<212> DNA 

<213> Homo sapiens 

<400> 74 77 
cggtgtagat gcacagcttc tc 

<210> 75 
<211> 69 
<212> DNA 

<213> Homo sapiens 

gcStgttcgt ggcctctaag atgaaggaga ccatccccct gacggccgag aagctgtgca 60 
tctacaccg 

<210> 76 
<211> 20 
<212> DNA 

<213> Homo sapiens 

<400> 76 ?(\ 
agatgaagtg gaaggcgctt 

<210> 77 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 77 71 
tgcctctgta atcggcaact g 

<210> 78 

<211> 65 

<212> DNA 

<213> Homo sapiens 

agatgaagtg gaaggcgctt ttcaccgcgg ccatcctgca ggcacagttg ccgattacag 60 
aggca 

<210> 79 
<211> 18 
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<212> DNA 

<213> Homo sapiens 

<400> 79 is 
tggttcccag ccctgtgt 

<210> 80 

<211> 19 

<212> DNA ^ 

<213> Homo sapiens 

<400> 80 19 
ctcctccacc ctgggttgt 

<210> 81 

<211> 74 

<212> DNA . 

<213> Homo sapiens 

tgg«c?cag ccctgtgtcc acctccaagc ccagattcag attcgagtca tgtacacaac 60 
ccagggtgga ggag 

<210> 82 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 82 20 
tcttgctggc tacgcctctt 

<210> 83 
<211> 21 
<212> DNA - m ■ 
<213> Homo sapiens 

<400> 83 21 
ctgcattgtg gcacagttct g 

<210> 84 
<211> 71 
<212> DNA 

<213> Homo sapiens 

g??J c ?gg C tacgcctctt ctgtccctgt tagacgtcct ccgtccatat cagaactgtg 60 
ccacaatgca g 

<210> 85 

<211> 21 

<212> DNA 

<213> Homo sapiens 



<400> 85 

tgagtgtccc ccggtatctt c 

<210> 86 
<211> 21 
<212> DNA 

<213i- Homo sapiens 
<400> 86 

cagccgcttt cagattttca t 

<210> 87 

<211> 81 

<212> DNA 

<213> Homo sapiens 



21 



21 
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tgagtgtccc ccggtatctt ccccgccctg ccaatcccga tgaaattgga aattttattg 60 
atgaaaatct gaaagcggct g 81 

<210> 88 
<211> 21 
<212> DNA 

<213> Homo sapiens 

<400> 88 7n 
tggagactct cagggtcgaa a 

<210> 89 
<211> 22 
<212> DNA 

<213> Homo sapiens 

<400> 89 0J 
ggcgtttgga gtggtagaaa tc 

<210> 90 

<211> 65 

<212> DNA 

<213> Homo sapiens 

tggagactct cagggtcgaa aacggcggca gaccagcatg acagatttct accactccaa 60 
acgcc b!> 

<210> 91 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 91 7 ' 

cggtggacca cgaagagtta a ^ x 

<210> 92 
<211> 19 
<212> DNA 

<213> Homo sapiens 

<400> 92 - Q 
ggctcgcctc ttccatgtc xy 

<210> 93 
<211> 66 
<212> DNA 

<213> Homo sapiens 

cggtggacca cgaagagtta acccgggact tggagaagca ctgcagagac atggaagagg 60 
cgagcc bt> 

<210> 94 
<211> 19 
<212> DNA 

<213> Homo sapiens 

<400> 94 1Q 
gcggaaggtc cctcagaca xy 

<210> 95 
<211> 23 
<212> DNA 

<213> Homo sapiens 
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<400> 95 ? 3 
tctaagtttc ccgaggtttc tea 

<210> 96 

<211> 70 

<212> DNA 

<213> Homo sapiens 

gjggaaggtc cctcagacat ccccgattga aagaaccaga gaggctctga gaaacctegg 60 
gaaacttaga 

<210> 97 

<211> 22 

<212> DNA ; 

<213> Homo sapiens 

<400> 97 77 
ccagctttgt gcctgtcact at 

<210> 98 

<211> 20 

<212> DNA . 

<213> Homo sapiens 

<400> 98 20 
gggaatgtgg tageccaaga 

<210> 99 
<211> 71 
<212> DNA • 
<213> Homo sapiens 

ccagctttgt gcctgtcact attcctcatg ccaccactgc caacacctct gtcttgggct 60 
accacattcc c 

<210> 100 

<211> 27 

<212> DNA 

<213> Homo sapiens 

<400> 100 77 
ttgetataac taagtgcttc tccaaga 

<210> 101 
<211> 22 
<212> DNA 

<213> Homo sapiens 

<400> 101 22 
gtggaatggc agctcactgt ag 

<210> 102 

<211> 73 

<212> DNA 

<213> Homo sapiens 

ttg?tataac taagtgcttc tccaagaccc caactgagtc cccagcacct gctacagtga 60 
gctgccattc cac 

<210> 103 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 103 „ 
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aggacgcaag gagggtttg . 19 

<210> 104 
<211> 21 
<212> DNA 

<213> Homo sapiens 

<400> 104 _ 
gatgtccgcc gagtccttac t Zl 

<210> 105 
<211> 87 
<212> DNA 

<213> Homo sapiens 
<400> 105 

aggacgcaag gagggtttgt cactggcaga ctcgagactg taggcactgc catggcccct 60 
gtgctcagta aggactcggc ggacatc 87 

<210> 106 
<211> 24 
<212> DNA 

<213> Homo sapiens 

<400> 106 nA 
ctatatgcag ccagagatgt gaca 24 

<210> 107 
<211> 24 
<212> DNA 

<213> Homo sapiens 
<400> 107 

ccacgagttt cttactgaga atgg 24 

<210> 108 
<211> 82 
<212> DNA 

<213> Homo sapiens 
<400> 108 

ctatatgcag ccagagatgt gacagccacc gtggacagcc tgccactcat cacagcctcc 60 
attctcagta agaaactcgt gg 82 

<210> 109 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 109 

tgtcgatgga cttccagaac 20 

<210> 110 

<211> 19 

<212> DNA , 

<213> Homo sapiens 

<400> 110 . * 

attgggacag cttggatca 19 

<210> 111 
<211> 62 
<212> DNA 

<213> Homo sapiens 
<400> 111 

tgtcgatgga cttccagaac cacctgggca gctgccaaaa gtgtgatcca agctgtccca 60 
at 62 
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<210> 112 
<211> 23 
<212> DNA . 

<213> Homo sapiens 

<400> 112 ?q 
gatctaagat ggcgactgtc gaa " 

<210> 113 
<211> 25 
<212> DNA 

<213> Homo sapiens 

<400> 113 ->c 
ttagattccg ttttctcctc ttctg 

<210> 114 
<211> 82 
<212> DNA 

<213> Homo sapiens 

gatctaagat ggcgactgtc gaaccggaaa ccacccctac tcctaatccc ccgactacag 60 
aagaggagaa aacggaatct aa 6Z 

<210> 115 
<211> 20 
<212> DNA 

<213> Homo sapiens 



<400> 115 ?n 
cggtgtgaga agtgcagcaa 

<210> 116 
<211> 19 
<212> DNA 

<213> Homo sapiens 

<400> 116 nq 
cctctcgcaa gtgctccat 

<210> 117 
<211> 70 
<212> DNA 

<213> Homo sapiens 

cggtgtgaga agtgcagcaa gccctgtgcc cgagtgtgct atggtctggg catggagcac 60 
ttgcgagagg u 

<210> 118 
<211> 23 
<212> DNA 

<213> Homo sapiens 

<400> 118 ?-> 
cggttatgtc atgccagata cac ^ 

<210> 119 
<211> 24 
<212> DNA 

<213> Homo sapiens 

<400> 119 0A 
gaactgagac ccactgaaga aagg 

<210> 120 
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<211> 81 

<212> DNA 

<213> Homo sapiens 

<400> 120 ' crv 

cggttatgtc atgccagata cacacctcaa aggtactccc tcctcccggg aaggcaccct 60 
ttcttcagtg ggtctcagtt c 81 

<210> 121 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 121 16 
cgtggtgccc ctctatgac xy 

<210> 122 
<211> 19 
<212> DNA 

<213> Homo sapiens 

<400> 122 no 
ggctagtggg cgcatgtag 

<210> 123 
<211> 68 
<212> DNA 

<213> Homo sapiens 



<400> 123 ^ cn 

cgtggtgccc ctctatgacc tgctgctgga gatgctggac gcccaccgcc tacatgcgcc 60 

cactagcc 68 

<210> 124 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 124 „ 
tggtccatcg ccagttatca ^ 

<210> 125 
<211> 23 
<212> DNA 

<213> Homo sapiens 

<400> 125 • 3 

tgttctagcg atcttgcttc aca 

<210> 126 

<211> 76 

<212> DNA 

<213> Homo sapiens 

<400> 126 

tggtccatcg ccagttatca catctgtatg cggaacctca aaagagtccc tggtgtgaag 60 
caagatcgct agaaca 

<210> 127 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 127 

catccatgac aactttggta tcgt ^4 

<210> 128 
<211> 21 
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<212> DNA 

<213> Homo sapiens 



<400> 128 21 
cagtcttctg ggtggcagtg a 

<210> 129 

<211> 74 

<212> DNA . 

<213> Homo sapiens 

ca?2caigac aactttggta tcgtggaagg actcatgacc acagtccatg ccatcactgc 60 

cacccagaag actg 

<210> 130 
<211> 23 
<212> DNA 

<213> Homo sapiens 

<400> 130 23 
caaaggagct cactgtggtg tct 

<210> 131 
<211> 26 
<212> DNA 

<213> Homo sapiens 

<400> 131 26 
gagtcagaat ggcttattca cagatg 

<210> 132 

<211> 75 

<212> DNA . 

<213> Homo sapiens 

caaagglg« cactgtggtg tctgtgttcc aaccactgaa tctggacccc atctgtgaat 60 
aagccattct gactc 

<210> 133 
<211> 20 
<212> DNA 

<213> Homo sapiens 

<400> 133 20 
ccatctgcat ccatcttgtt 

<210> 134 
<211> 20 
<212> DNA 

<213> Homo sapiens 

<400> 134 20 
ggccaccagg gtattatctg 

<210> 135 
<211> 67 
<212> DNA 

<213> Homo sapiens 

c«?ctgcat ccatcttgtt tgggctcccc acccttgaga agtgcctcag ataataccct 60 
ggtggcc 

<210> 136 
<211> 23 

<212> DNA 7 
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<213> Homo sapiens 
<400> 136 

cgaaaagatg ctgaacagtg aca 

<210> 137 
<2il> 20 
<212> DNA 
<213> Homo sapiens 

<400> 137 

tcaggaacag ccaccagtga 

<210> 138 

<211> 73 

<212> DNA 

<213> Homo sapiens 

<400> 138 . 
cgaaaagatg ctgaacagtg acaaatccaa ctgaccagaa gggaggagga agctcactgg 60 

tggctgttcc tga ; 73 

<210> 139 

<211> 20 

<212> DNA 

<213> Homo sapiens 



20 



23 



<400> 139 

gagaccctgc tgtcccagaa 

<210> 140 
<211> 23 

<212> DNA . 
<213> Homo sapiens 

<400> 140 

ggttgtagtc agcgaaggag ate 

<210> 141 

<211> 76 

<212> DNA 

<213> Homo sapiens 

<400> 141 

gagaccctgc tgtcccagaa ccagggaggc aagaccttca ttgtgggaga ccagatctcc bO 
ttegctgact acaacc 76 

<210> 142 

<211> 20 

<212> DNA 

<213> Homo sapiens 



<400> 142 

cccactcagt agecaagtea 

<210> 143 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 143 

caegcaggtg gtatcagtct 

<210> 144 

<211> 73 

<212> DNA 

<213> Homo sapiens 



20 



20 
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<400> 144 

cccactcagt agccaagtca caatgtttgg aaaacagccc gtttacttga gcaagactga 60 
taccacctgc gtg * 73 

<210> 145 
<211> 24 
<212> DNA 

<213> Homo sapiens 

<400> 145 ~ A 
catcaaatgt cagccctgga gttc 4* 

<210> 146 * 

<211> 26 

<212> DNA 

<213> Homo sapiens 

<400> 146 

ttcctgtagg tctttacccc gatagc <b 

<210> 147 

<211> 85 

<212> DNA 

<213> Homo sapiens 

catcaaatgt cagccctgga gttccatgat accacacgaa cacagctttt tgccttcgag 60 
ctatcggggt aaagacctac aggaa 85 

<210> 148 
<211> 24 
<212> DNA 

<213> Homo sapiens 
<400> 148 

tccaggatgt taggaactgt gaag ^ 

<210> 149 
<211> 22 
<212> DNA 

<213> Homo sapiens 

<400> 149 „ 
gcgtgtctgc gtagtagctg tt " 

<210> 150 

<211> 73 

<212> DNA 

<213> Homo sapiens 

<400> 150 _^ 
tccaggatgt taggaactgt gaagatggaa gggcatgaaa ccagcgactg gaacagctac 60 

tacgcagaca cgc 75 

<210> 151 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 151 '„ 
aacgactgct actccaagct caa " 

<210> 152 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 152 
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ggatttccat cttgctcacc tt 22 

<210> 153 
<211> 76 
<212> DNA 

<213> Homo sapiens 

<400> 153 x ' 
aacgactgct actccaagct caaggagctg gtgcccagca tcccccagaa caagaaggtg 60 
agcaagatgg aaatcc 76 

<210> 154 
<211> 21 
<212> DNA 

<213> Homo sapiens 
<400> 154 

tccggagctg tgatctaagg a 21 

<210> 155 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 155 

cggacagagc gagctgactt 20 

<210> 156 
<211> 76 
<212> DNA 

<213> Homo sapiens ■ 
<400> 156 

tccggagctg tgatctaagg aggctggaga tgtattgcgc acccctcaag cctgccaagt 60 
cagctcgctc tgtccg 76 

<210> 157 
<211> 17 
<212> DNA 

<213> Homo sapiens 
<400> 157 

acgcaccggg tgtctga 17 

<210> 158 
<211> 24 
<212> DNA 

<213> Homo sapiens 
<400> 158 

tgccctttct tgatgatgat tatc 24 

<210> 159 

<211> 68 

<212> DNA 

<213> Homo sapiens 

<400> 159 

acgcaccggg tgtctgatcc caagttccac cccctccatt caaagataat catcatcaag 60 
aaagggca 68 

<210> 160 
<211> 22 
<212> DNA 

<213> Homo sapiens 
<400> 160 

ccattcaccc tgtgtaacag ga 22 
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<210> 161 
<211> 21 
<212> DNA 

<213> Homo sapiens 
<400> 161 ' 

ccgaccctct aggttaaggc a 21 

<210> 162 
<211> 68 
<212> DNA 

<213> Homo sapiens 
<400> 162 

ccattcaccc tgtgtaacag gaccccaagg acctgcctcc ccggaagtgc cttaacctag 60 
agggtcgg 68 

<210> 163 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 163 

cgtcaggacc caccatgtct 20 

<210> 164 
<211> 24 
<212> DNA 

<213> Homo sapiens 
<400> 164 

ggttaattgg tgacatcctc aaga 24 

<210> 165 
<211> 81 
<212> DNA 

<213> Homo sapiens 
<400> 165 

cgtcaggacc caccatgtct gccccatcac gcggccgaga catggcttgg ccacagctct 60 
tgaggatgtc accaattaac c 81 

<210> 166 
<211> 23 
<212> DNA 

<213> Homo sapiens 
<400> 166 

caaacgctga catgtacggt eta 23 

<210> 167 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 167 

gctcgttggc gcactctt 18 

<210> 168 
<211> 88 
<212> DNA 

<213> Homo sapiens 
<400> 168 

caaacgctga catgtacggt etatgecatt cctcccccgc atcacatcca ctggtattgg 60 
cagttggagg aagagtgcgc caacgagc 88 
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<210> 169 
<211> 25 
<212> DNA 

<213> Homo sapiens 
<400> 169 

gaggcaactg cttatggctt aatta 25 

<210> 170 ' 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 170 

ggcactcggc ttgagcat 18 

<210> 171 
<211> 75 
<212> DNA 

<213> Homo sapiens 
<400> 171 

gaggcaactg cttatggctt aattaagtca gatgcggcca tgactgtcgc tgtaaagatg 60 
ctcaagccga gtgcc 75 

<210> 172 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 172 

gtccccggga tggatgtt 18 

<210> 173 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 173 

gatcagtcaa gctgtctgac aattg 25 

<210> 174. 
<211> 79 
<212> DNA 

<213> Homo sapiens 
<400> 174 

gtccccggga tggatgtttt gccaagtcat tgttggataa gcgagatggt agtacaattg 60 
tcagacagct tgactgatc 79 

<210> 175 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 175 

cgaggattgg ttcttcagca a 21 

<210> 176 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 176 

actctgcacc agctcactgt tg 22 

<210> 177 
<211> 73 
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<212> DNA 

<213> Homo sapiens 

cgaggat?gg ttcttcagca agacagagga actgaaccgc gaggtggcca ccaacagtga 60 
gctggtgcag agt 

<210> 178 
<211> 20 
<212> DNA 

<213> Homo sapiens 

<400> 178 20 
tcagtggaga aggagttgga 

<210> 179 

<211> 20 ^ 
<212> DNA 

<213> Homo sapiens 

<400> 179 20 
tgccatatcc agaggaaaca 

<210> 180 
<211> 69 
<212> DNA 

<213> Homo sapiens 

tSgtgglga aggagttgga ccagtcaaca tctctgttgt cacaagcagt gtttcctctg 60 
gatatggca 

<210> 181 
<211> 26 
<212> DNA 

<213> Homo sapiens 

<400> 181 26 
gtacaagaga gaaccagact ccaatg 

<210> 182 
<211> 18 
<212> DNA 

<213> Homo sapiens 

<400> 182 18 
gtgtagcccg cggacact 

<210> 183 
<211> 87 
<212> DNA 

<213> Homo sapiens 

g«2aagaga gaaccagact ccaatgtcat tgtggtggac tggctgtcac gggctcagga 60 
gcattaccca gtgtccgcgg gctacac 

<210> 184 
<211> 22 
<212> DNA 

<213> Homo sapiens 

<400> 184 22 
gacatttcca gtcctgcagt ca 

<210> 185 
<2U> 20 

<212> DNA ^ 
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<213> Homo sapiens 
<400> 185 

ctccgatcgc acacatttgt 20 

<210> 186 
<211> 86 
<212> DNA 

<213> Homo sapiens 
<400> 186 

gacatttcca gtcctgcagt caatgcctct ctgccccacc ctttgttcag tgtggctggt 60 
gccacgacaa atgtgtgcga tcggag 86 

<210> 187 
<211> 24 
<212> DNA 

<213> Homo sapiens 
<400> 187 

gttttggagg aaatgtgttc ttca 24 

<210> 188 
<211> 26 
<212> DNA 

<213> Homo sapiens 

<400> 188 • 
ttctctaata cactgccgtc ttaagg 26 

<210> 189 
<211> 101 
<212> DNA 

<213> Homo sapiens 
<400> 189 

gttttggagg aaatgtgttc ttcagtgcac agaatgcagc aaaacagcca tctgataaat 60 
gctctgcaag ccctccctta agacggcagt gtattagaga a 101 

<210> 190 
<211> 22 
<212> DNA 

<213> Homo sapiens 
<400> 190 

acgagaacga gggcatctat gt 22 

<210> 191 
<211> 22 
<212> DNA 

<213> Homo sapiens 
<400> 191 

gcatgtaggt gcttccaatc ac 22 

<210> 192 
<211> 75 
<212> DNA 

<213> Homo sapiens 

<400> 192 ^ 
acgagaacga gggcatctat gtgcaggatg tcaagaccgg aaaggtgcgc gctgtgattg 60 
gaagcaccta catgc 75 

<210> 193 
<211> 21 
<212> DNA 

<213> Homo sapiens 
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<400> 193 . • • ' " 21 

tccctccact cggaaggact a 

<210> 194 
<211> 22 
<212> DNA 

<213> Homo sapiens 

<400> 194 . 22 

cggttgttgc tgatctgtct ca 

<210> 195 
<211> 84 
<212> DNA 

<213> Homo sapiens 

tccctccact cggaaggact atcctgctgc caagagggtc aagttggaca gtgtcagagt 60 
cctgagacag atcagcaaca accg 

<210> 196 
<211> 19 
<212> DNA 

<213> Homo sapiens 

<400> 196 19 
ttgttggtgt gccctggtg 

<210> 197 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 197 21 
tgggttctgt ccaaacactg g 

<210> 198 
<211> 67 
<212> DNA 

<213> Homo sapiens 

«g?tgg?gt gccctggtgc cgtggtggcg gtcactccct ctgctgccag tgtttggaca 60 
gaaccca 

<210> 199 
<211> 20 
<212> DNA 

<213> Homo sapiens 

<400> 199 20 
actgaaggag acccttggag 

<210> 200 
<211> 20 
<212> DNA 

<213> Homo sapiens 

<400> 200 20 
taaataaccc tgcccacaca 

<210> 201 
<211> 62 
<212> DNA 

<213> Homo sapiens 
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aetgaaggag acccttggag cctaggggca tcggcaggag agtgtgtggg cagggttatt 60 

ta ; 62 

<210> 202 
<211> 28 
<212> DNA 

<213> Homo sapiens 

<400> 202 ->o 
agttactaaa aaataccacg aggtcctt " 

<210> 203 
<211> 21 
<212> DNA 

<213> Homo sapiens 
<400> 203 . 

gtcggtgagt gatttgtgca a ^ 1 

<210> 204 
<211> 79 
<212> DNA 

<213> Homo sapiens 
<400> 204 

agttactaaa aaataccacg aggtccttca gttgagacca aagaccggtg tcaggggatt 60 
gcacaaatca ctcaccgac 7 9 

<210> 205 
<211> 20 
<212> DNA 

<213> Homo sapiens 

<400> 205 „ 
gggagtttcc aagagatgga ^ 

<210> 206 
<211> 20 
<212> DNA 

<213> Homo sapiens 

<400> 206 „ 
cttcaaccac cttcccaaac ^ u 

<210> 207 
<211> 72 
<212> DNA 

<213> Homo sapiens 
<400> 207 

gggagtttcc aagagatgga ctagtgcttg gtcgggtctt ggggtctgga gcgtttggga 60 
aQQtggttga ag 72 

<210> 208 
<211> 23 
<212> DNA 

<213> Homo sapiens 

<400> 208 „ 
aggtgtcatc catcaacgtc tct ^ 

<210> 209 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 209 

tcccgatcac aatgcacatg ^O 
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<210> 210 

<211> 90 

<212> DNA . 

<213> Homo sapiens 

aq2?gtca?c catcaacgtc tctgtgaacg cagtgcag^c tgtggtccgc cagggtgaga 60 
acatcaccct catgtgcatt gtgatcggga 

<210> 211 
<211> 24 
<212> DNA 

<213> Homo sapiens 

<400> 211 24 
agagccagtt gctgtagaac tcaa 

<210> 212 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 212 21 
ctgggcctac acagtccttc a 

<210> 213 

<211> 74 

<212> DNA 

<213> Homo sapiens 

ag2gccag« gctgtagaac tcaaatctct gctgggcaag gatgttctgt tcttgaagga 60 
ctgtgtaggc ccag 

<210> 214 

<211> 26 

<212> DNA 

<213> Homo sapiens 

<400> 214 26 
gaaatgactg catcgttgat aaaatc 

<210> 215 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 215 19 
tgccagcctg acagcactt 

<210> 216 

<211> 78 

<212> DNA 

<213> Homo sapiens 

gaaSgi«g catcgttgat aaaatccgca gaaaaaactg cccagcatgt cgccttagaa 60 
agtgctgtca ggctggca 

<210> 217 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 217 20 
gatcaacggc tacatccaga 
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<210> 218 
<211> 20 
<212> DNA 

<213> Homo sapiens 

<400> 218 20 
tgaactgtga ggccagagac 

<210> 219 
<211> 68 
<212> DNA 
<213> Homo sapiens 

ga?2aacggc tacatccaga agatcaagtc gggagaggag gactttgagt ctcnggcctc 60 
acagttca 

<210> 220 

<211> 19 

<212> DNA . 

<213> Homo sapiens 



19 



20 



<400> 220 

gtggatgtgc cctgaagga 

<210> 221 
<211> 20 ; 
<212> DNA , 
<213> Homo sapiens 

<400> 221 

ctgcggatcc agggtaagaa 

<210> 222 
<211> 70 
<212> DNA 

<213> Homo sapiens 

gSgatg?gc cctgaaggac aagccaggcg tctacacgag agtctcacac ttcttaccct 60 
ggatccgcag 

<210> 223 

<211> 27 

<212> DNA 

<213> Homo sapiens 



27 



22 



<400> 223 

tggacttcta gtgatgagaa agattga 

<210> 224 
<211> 22 
<212> DNA 

<213> Homo sapiens 
<400> 224 

cactgcgaga tcaccacagg ta 

<210> 225 

<211> 84 

<212> DNA 

<213> Homo sapiens 

tgg2ct?cta gtgatgagaa agattgagaa tgttcccaca ggccccaaca ataagcccaa 60 
gctacctgtg gtgatctcgc agtg 



<210> 226 



Page 28 



SUBSTITUTE SHEET (RULE 26) 



BNSOOCID: <WO 03078662A1_1_> 



WO 03/078662 



PCT/US03/07713 



39740-000lPCT.txt 

<211> 25 

<212> DNA. 

<213> Homo sapiens 

<400> 226 7 r 
tggctaagtg aagatgacaa tcatg " 

<210> 227 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 227 „ 
tgcacatatc attacaccag ttcgt " 

<210> 228 
<211> 81 
<212> DNA 

<213> Homo sapiens ? 

tggctaagtg aagatgacaa tcatgttgca gcaattcact gtaaagctgg aaagggacga 60 
actggtgtaa tgatatgtgc a 6X 

<210> 229 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 229 ?^ 
tctgcagagt tggaagcact eta 

<210> 230 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 230 ?i 
gecgaggett ttctaccaga a 

<210> 231 

<211> 79 

<212> DNA 

<213> Homo sapiens 

tctgcagagt tggaagcact ctatggtgac ategatgetg tggagctgta tcctgccctt 60 
ctggtagaaa agcctcggc 

<210> 232 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 232 yd 
aegacaegta tgccgtacag tact 

<210> 233 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 233 1R 
cegggaaaac acgaagga 

<210> 234 
<211> 86 
<212> DNA 
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<213> Homo sapiens 

<400> 234 ^ 
acgacacgta tgccgtacag tactcctgcc gcctcctgaa cctcgatggc acctgtgctg 60 
acagctactc cttcgtgttt tcccgg 86 

<210> 235 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 235 „ 
ctgccgggat ggcttctat iy 

<210> 236 
<211> 22 
<212> DNA 

<213> Homo sapiens 

<400> 236 „ 
ccaggttctg gaaactgtgg at 2Z 

<210> 237 
<211> 68 
<212> DNA 

<213> Homo sapiens 
<400> 237 

ctgccgggat ggcttctatg aggctgagct ctgcccggac cgctgcatcc acagtttcca 60 
gaacctgg 68 

<210> 238 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 238 

ccacaagctg aaggcagaca 20 

<210> 239 
<211> 21 
<212> DNA 

<213> Homo sapiens 
<400> 239 

gcgtgcttcc ttggtcttag a 21 

<210> 240 
<211> 85 
<212> DNA 

<213> Homo sapiens 
<400> 240 

ccacaagctg aaggcagaca aggcccgcaa gaagctcctg gctgaccagg ctgaggcccg 50 
caggtctaag accaaggaag cacgc 85 

<210> 241 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 241 

ccattctatc atcaacgggt acaa 24 

<210> 242 
<211> 23 
<212> DNA 

<213> Homo sapiens 
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<400> 242 - - ?3 

tcagcaagtg ggaaggtgta ate 

<210> 243 
<211> 75 

<212> DNA . . . - 
<213> Homo sapiens 

c«?tcta?c ateaaegggt acaaacgagt cctggccttg tctgtggaga eggattacac 60 
cttcccactt gctga 

<210> 244 
<211> 20 
<212> DNA 

<213> Homo sapiens 

<400> 244 20 
tatcgaggca ggtcatacca 

<210> 245 
<211> 20 
<212> DNA 

<213> Homo sapiens 

<400> 245 20 
taacgcttgg catcatcatt 

<210> 246 
<211> 74 
<212> DNA 

<213> Homo sapiens 

ta?2gaggca ggtcatacca tgaccggaag tcaaaagttg acctggatag gctcaatgat 60 
gatgecaage gtta 

<210> 247 
<211> 19 
<212> DNA 

<213> Homo sapiens 

<400> 247 !9 
ccgcaacgtg gttttctca 

<210> 248 
<211> 21 
<212> DNA 

<213> Homo sapiens 

<400> 248 21 
tgctgggttt ctcctcctgt t 

<210> 249 
<211> 81 
<212> DNA 

<213> Homo sapiens 

ccg?aacg?g gttttctcac cctatggggt ggcctcggtg ttggccatgc tccagctgac 60 
aacaggagga gaaacccagc a 

<210> 250 
<211> 25 
<212> DNA 

<213> Homo sapiens 
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<400> 250 -> c 
tcaagaccat catcactttc attgt ^ 

<210> 251 
<211> 27 
<212> DNA 

<213> Homo sapiens 

<400> 251 , 7 
ggatcaggaa gtacacggag tataact 

<210> 252 
<211> 96 
<212> DNA 

<213> Homo sapiens 

<400> 252 ^ „ 

tcaagaccat catcactttc attgtctcgg acgtgcgggg cctgggcctc ccggtccgca 60 
agcagttcca gttatactcc gtgtacttcc tgatcc 96 

<210> 253 
<211> 19 
<212> DNA 

<213> Homo sapiens 

<400> 253 1Q 
gcccgaaacg ccgaatata iy 

<210> 254 
<211> 23 
<212> DNA 

<213> Homo sapiens 

<400> 254 ,w 
cgtggctctc ttatcctcat gat ^ 

<210> 255 

<211> 65 

<212> DNA 

<213> Homo sapiens 

<400> 255 

gcccgaaacg ccgaatataa tcccaagcgg tttgctgcgg taatcatgag gataagagag 60 
ccacg 65 

<210> 256 
<211> 19 
<212> DNA 

<213> Homo sapiens 

<400> 256 1Q 
gccctcccag tgtgcaaat xy 

<210> 257 
<211> 25 
<212> DNA 

<213> Homo sapiens 
<400> 257 

cgtcgatggt attaggatag aagca " 

<210> 258 
<211> 86 
<212> DNA 

<213> Homo sapiens 
<400> 258 

gccctcccag tgtgcaaata agggctgctg tttcgacgac accgttcgtg gggtcccctg 60 
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gtgcttctat cctaatacca tcgacg ab 

<210> 259 

<211> 27 

<212> DNA 

<213> Homo sapiens 

<400> 259 ' . . . 27 

caagctagat cagcattctc taacttg 

<210> 260 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 260 ■ .'. 25 

cacatgactg ttatcgccat ctact 

<210> 261 

<211> 99 

<212> DNA ; 

<213> Homo sapiens 

ca&tlgat cagcattctc taacttgttt ggtggagaac cattgtcata tacccggttc 60 
agcctggctc ggcaagtaga tggcgataac agtcatgtg ^ 

<210> 262 
<211> 22 
<212> DNA 

<213> Homo sapiens 

<400> 262 22 
cacaggaaca acagcatctt tc 

<210> 263 
<211> 20 
<212> DNA 

<213> Homo sapiens 

<400> 263 20 
agataagccc ctgggatcca 

<210> 264 
<211> 75 
<212> DNA 

<213> Homo sapiens 

ca?aggaa« acagcatctt tcaccaagat gggtggcacc aaccttgctg ggacttggat 60 
cccaggggct tatct 

<210> 265 
<211> 21 
<212> DNA 

<213> Homo sapiens 

<400> 265 21 
ggattgctca acaaccatgc t 

<210> 266 
<211> 24 
<212> DNA 

<213> Homo sapiens 

<400> 266 24 
ggcattaaca cttttggacg ataa 
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<210> 267 

<211> 91 

<212> DNA 

<213> Homo sapiens 

ggattgctca acaaccatgc tgggcatctg gaccctccta cctctggttc ttacgtctgt 60 
tgctagatta tcgtccaaaa gtgttaatgc c : ?± 

<210> 268 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 268 

gcactttggg attctttcca ttat t<* 

<210> 269 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 269 yd 
gcatgtaaga agaccctcac tgaa ■ c ** 

<210> 270 

<211> 80 

<212> DNA 

<213> Homo sapiens 

gcactttggg attctttcca ttatgattct ttgttacagg caccgagaat gttgtattca 60 
gtgagggtct tcttacatgc ttU 

<210> 271 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 271 20 
aatccaaggg ggagagtgat 

<210> 272 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 272 7n 
gtacagattt tgcccgagga ^ 

<210> 273 

<211> 72 

<212> DNA 

<213> Homo sapiens 

aatccaaggg ggagagtgat gacttccata tggactttga ctcagctgtg gctcctcggg 60 
caaaatctgt ac 

<210> 274 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 274 71 
tgtggacatc ttcccctcag a 

<210> 275 
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<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 275 is 
ctagcccgac cggttcgt 

<210> 276 

<211> 66 

<212> DNA 

<213> Homo sapiens 

tSaSc ttcccctcao acttccctac tgagccacct tctctgccac gaaccggtcg 60 
ggctag 

<210> 277 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> .277 20 
ctttgaaccc ttgcttgcaa 

<210> 278 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 278 is 
cccgggacaa agcaaatg 

<210> 279 

<211> 68 

<212> DNA 

<213> Homo sapiens 

«??Saac?c ttgcttgcaa taggtgtgcg tcagaagcac ccaggacttc catttgcttt 60 
gtcccggg 

<210> 280 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 280 18 
gcctcggtgt gcctttca 

<210> 281 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 281 19 
cgtgatgtgc gcaatcatg 

<210> 282 

<211> 65 

<212> DNA 

<213> Homo sapiens 

g£2?cgg?gt gcctttcaac atcgccagct acgccctgct cacgtacatg attgcgcaca 60 
tcacg 

<210> 283 
<211> 20 
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<212> DNA 

<213> Homo sapiens 
<400> 283 

ctgctgtctt gggtgcattg 

<210> 284 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 284 

gcagcctggg accacttg 

<210> 285 

<211> 71 

<212> DNA 

<213> Homo sapiens 

ctgctgtctt gggtgcattg gagccttgcc ttgctgctct acctccacca tgccaagtgg 60 
tcccaggctg c /x 

<210> 286 
<211> 1947 
<212> DNA 

<213> Homo sapiens 



<400> 286 

ttttccccag 

gaaaggaaaa 

gccagcttct 

agtgacttta 

ataatatgcg 

actcaatggg 

tgtcatcctt 

atttaaacag 

tcattattct 

ttaatatatg 

atgtaggcaa 

gtcctggaat 

acgtgttttt 

ccatctgccc 

aatatgaact 

aattcagagt 

gagagaaagc 

tacataagtt 

ttcccaaatg 

aggttatgtg 

agactttcag 

cctgctgttg 

aggggtttgg 

taaagatgct 

tgcatgatta 

aaagtgacca 

gattcaacca 

catgtgcctg 

ccccagcctc 

actctgactt 

gtgtaaagag 

cacagcattc 

aaatcaatga 



atatggggtt 

atgtgtggag 

tactccatct 

tggagaagat 

atgatgagga 

aagtttttct 

cttgtatttt 

tatttatttt 

gaacaaagtg 

tttattgaaa 

gagcaatatg 

tattcaggtg 

gtgaaacttg 

tccatagaaa 

gacagcagta 

ttgttaggct 

aagacattta 

aggattaatc 

aaggtaccaa 

ggatgatgtg 

gcctgacttc 

caggatgcta 

cttcctcatt 

ataagacaat 

cactccactg 

caccaaagcc 

ccacccaggg 

tcagaatggc 

aaggagccaa 

ttctaccaca 

ccatgatgtg 

atgtcaatat 

ataaaggttt 



ctattcagcc 

attagtccta 

gcagttcata 

tctgcattaa 

gaaagtagac 

ttcttttaga 

ttgtacccca 

tgtaaggcat 

atcaaattag 

gttatctata 

tttgtctcaa 

gtgaccatca 

gtagaaccat 

aatgtctctg 

actetgatac 

tatcagtatg 

ttaagcacct 

cctgcaagaa 

agctcaaacg 

ggattgaaga 

ataggaattc 

tttgcatgtg 

catccctctt 

gaaagtttga 

acatcttcca 

tccctggctg 

agtgcttgca 

agtgtttaat 

ggaagagtca 

ttaaattctc 

tatattaagc 

gggattaatg 

tgagtgt 



atagataatc 

gttcattctg 

ctgccaaaga 

attgtctttc 

cagtgaggtg 

tattgctttt 

agttacaatt 

aactagaaac 

aatacatatt 

atacttgcac 

ggatttttcc 

ctggtctaag 

ggctaataaa 

gctcataaaa 

agaataatct 

ttgcttttaa 

cgtatgtgcc 

tcctataaag 

caatgttgtg 

ggaaagaaag 

atccatctta 

tccccaggtg 

gctaaaagag 

tgttgtacat 

agtactgcat 

gtgtacaggg 

gactctgcat 

tctcttgaaa 

ttcacatgga 

cattacatct 

tatgtgccac 

cctaaacttt 



tagacagagg 

agggccgact 

gctcccactt 

gaatgatggg 

attgcaagac 

gaagtagatg 

tttcttcttc 

taaaatatat 

tttcaacagt 

cagtgttgaa 

atggtttcct 

tttgtgtgca 

gaggacagtg 

tgagactccc 

aaattgcatc 

ttggggtggg 

aggcactatg 

aatgttacta 

aagctgtttc 

gtgggattat 

tcatgtggag 

atgttttttc 

gagatagttg 

acctacaagt 

gtgattgaat 

atcaggtcca 

agatgttgct 

gaaagttatt 

aggtccgggt 

cactattggt 

atatttattt 

gtaaatattg 



atttcagaat 

aagtggctca 

ccaaatcccc 

gaagcaaggc 

taacaaggag 

gtaaaatttt 

cttgtaaata 

tctaaaaaat 

ggtagagctt 

aaaagttaac 

cagtgatggt 

gggttttcag 

ttgtcagggt 

tcagggacta 

aaatggcctt 

aaagtagagg 

ctaagcactt 

gcatttacac 

cttcagattt 

ccccctagga 

tttatctcac 

tttggggagt 

atgttgcatc 

accatttttg 

aagaaacaag 

cagtggtaca 

gcatgcgtcc 

tgctcactat 

ctggtcagcc 

aatggcttaa 

ttagactctc 

tacagtttgt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1947 



<210> 287 

<211> 1311 

<212> DNA 

<213> Homo sapiens 
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<400> 287 

tagtcgggcg 

cgggcctggc 

cccggctgcg 

gtggtccgcc 

ccaccagggg 

gctcgacccg 

cctggagtga 

gccaggaggt 

tggcggcagc 

ttacctccca 

aggtcatagg 

aaatggaaac 

ggaaaaagaa 

ctgtggagaa 

agggttttct 

aagccacaat 

aaaatttcaa 

tagccgagtg 

caaactttgc 

gcgccaccag 

ggcaactggc 

ctttgtgatt 



gggttgtgag 

tcagcgcggg 

cgcccttcgg 

tccctctcgg 

cgccgccgcc 

gagcgaggag 

agaggcgacc 

gacccgggac 

tgggctcacc 

gcagggcagc 

ggttccacag 

accgttgtca 

cagtccacag 

gatagctgac 

gcccaaggat 

agagcagttt 

agacagtaga 

tgacacagtg 

cctggccgag 

ctctgccgtc 

catttgccaa 

tgagtaaagc 



acgccgcgct 

ggggcgcgga 

ccaggccggg 

cgtccacctg 

ggcgctcgca 

ttgacccgga 

cagagtgagg 

gaggagncga 

gtgactgtca 

agtgaaccag 

tcttttcaga 

gcacttggaa 

gaagaggttg 

cagctggaag 

ttgcaagctg 

atgaagatct 

ttgaaaagga 

gagcagaaca 

tgaggtgtag 

tctggatcgg 

ttttcctact 

tcctattctg 



cagcttccat 

gaccgcgagg 

agccgcgcca 

cccggagtac 

ggccgcggat 

gcgaggagtt 

aggcgaccca 

cccggagcga 

cccacagcaa 

ttgtccaaga 

aactcatatt 

tacaagatgg 

aactaaagaa 

agttgaataa 

aagctctctg 

tggaggagat 

aaggcttggt 

tctgccagga 

cagaaaaagg 

aatttacctg 

cteacactgg 

tttttcacaa 



cgctgggcgg 
cgaccgggag 
gtcggagccc 
tgccagcggg 
gaagaagaaa 
gaccctgagt 
gggcgaagag 
ggaggtgacc 
tgagaagcac 
cctggcccag 
taagggaaaa 
ttgccgggtc 
gttgaaacat 
agagcttact 
caaacttgat 
tgacacactg 
aaaaaaggtt 
gactgagcgg 
ctgtgctgcc 
atttcttcag 
ttctcaatga 
aaaaaaaaaa 



tcaacaagtg 60 
cggctgggtt 120 
ccggcccagc 180 
catgaccgac 240 
acccggcgcc 300 
gaggaagcga 360 
atgaatcgga 420 
agggaggaaa 480 
gaccttcatg 540 
gttgttgaag 600 
tctctgaagg 660 
atgttaattg 720 
ttggagaagt 780 
ggaatccagc 840 
aggagagtaa 900 
atcctgccag 960 
caggcattcc 1020 
ctgcagtcta 1080 
ctgaagaatg 1140 
ggctgctggg 1200 
aaaatagtgt 1260 
a 1311 



<210> 288 
<211> 582 
<212> DNA 

<213> Homo sapiens 



<400> 288 

atggcccgcg 

ggcccgcgcc 

gagcccggcc 

gcccccaccg 

ccccgcagcc 

gagcagcacc 

caggcggccc 

ctgcggcgga 

cagcggcacc 

ttacccaggg 



cacgccagga 
ccttcccgct 
tggctgccgc 
ccccacccgc 
ggccccgagg 
tggagtcgcc 
cgggagnccg 
tggcggacga 
gcccctcacc 
gccacagagc 



gggcagctcc 
cggccgcctg 
ccccgccgcc 
cgtcaccgcc 
cccgcgcccg 
cgtgcccagc 
cggggaggag 
cctcaacgca 
ctggagggtc 
ccccgagatg 



ccggagcccg 
gtgccctcgg 
cccaccctgc 
gccctggggg 
gacggtcctc 
gccccggggg 
gaacagtggg 
cagtacgagc 
ctgtacaatc 
gagcccaatt 



tagagggcct 
cagtgtcctg 
tgcccgctgc 
gttcccgctg 
agccctcgct 
ctctggcggg 
cccgggagat 
ggcggagaca 
tcatcatggg 
ag 



ggcccgcgac 60 
cggcctctgc 120 
ctacctctgc 180 
gcctgggggt 240 
ctcgctggcg 300 
cggtcccacc 360 
cggggcccag 420 
agaggagcag 480 
actcctgccc 540 
582 



<210> 289 
<211> 6030 
<212> DNA 

<213> Homo sapiens 



<400> 289 

gttggccccc 

taaccgggag 

ggatgcggga 

ctcgcagccc 

gccgctgcag 

acctgtggtc 

cgacttcgcc 

tgccacggtg 

ctttgagttc 

ggacaacatc 

ggataacgga 

tgatttctcc 

caccctgggt 

caaaaggttc 

agctgcaggc 

cacacacaca 

aagggaaata 

gtcccatcaa 

tggctccacc 

gatcaccatc 

tctggctgct 



gttacttttc 

atagtgatga 

gatgtgggcg 

gggcacacgc 

accccggctg 

cacctgaccc 

gagatgtcca 

gtggaggagc 

ggtggggtca 

gccctgtgga 

ggctgggatg 

tggctgnctc 

gcctatctgg 

actaaagcag 

tgtttaagaa 

caacaattaa 

tcatttattt 

aactcctgtc 

tggatgttct 

tgaagagcag 

ggaggctggg 



ctctgggaaa 

agtacatcca 

ccgcgccccc 

cccatacagc 

cccccggcgc 

tccgccaggc 

ggcagctgca 

tcttcaggga 

tgtgtgtgga 

tgactgagta 

cctttgtgga 

tgaagactct 

gccacaagtg 

tagaaataat 

aaaataacac 

cagtcttcag 

tttacattat 

tttggaaatc 

gtgcctgtaa 

acggatggaa 

gagaaggtgt 



tatggcgcac gctgggagaa 
ttataagctg tcgcagaggg 
gggggccgcc cccgcgccgg 
cgcatcccgg gacccggtcg 
cgccgcgggg cctgcgctca 
cggcgacgac ttctcccgcc 
cctgacgccc ttcaccgcgc 
cggggtgaac tgggggagga 
gagcgtcaac cgggagatgt 
cctgaaccgg cacctgcaca 
actgtacggc cccagcatgc 
gctcagtttg gccctggtgg 
aagtcaacat gcctgcccca 
atgcattgtc agtgatgttc 
acatataaac atcacacaca 
gcaaaacgtc gaatcagcta 
taagaaaaaa agatttattt 
cgaccactaa ttgccaagca 
acatagattc gctttccatg 
aaaggacctg atcattgggg 
tcattcactt gcatttcttt 
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cagggtacga 
gctacgagtg 
gcatcttctc 
ccaggacctc 
gcccggtgcc 
gctaccgccg 
ggggacgctt 
ttgtggcctt 
cgcccctggt 
cctggatcca 
ggcctctgtt 
gagcttgcat 
aacaaatatg 
catgaaacaa 
cagacagaca 
tttactgcca 
atttaagaca 
ccgcttcgtg 
ttgttggccg 
aagctggctt 
gccctggggg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 



SUBSTITUTE SHEET (RULE 26) 
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ctgtgatatt 
gaagagactc 
aacttcagat 
atgcccttaa 
ttagcaattt 
ttggatacgc 
tggaacttga 
gagcatcagg 
tgtcccagct 
ggatgatcaa 
cactgtcaag 
gtaggcccgt 
tagagggaag 
aacgtgagga 
tgtggccttg 
caaatcctaa 
attacatgta 
aaaaagttcc 
atcaatggtg 
aggaaatttt 
tggcattatt 
gtttaiatgct 
ttttttgttt 
gttttctgtt 
agccctgagt 
tttcaaccaa 
tgttgtccct 
attcatggta 
aaatccagat 
tgctttacgt 
ggggctttct 
ccaagaaagc 
gaatgatcag 
tacattgtca 
attttaatgg 
aagtaacaat 
ttcaaacggg 
aattgctggc 
tgtgcccagc 
agttgatgtt 
agcctcaact 
agttgggcaa 
aactgctctt 
ggtcagagtt 
ccccaaaagg 
atactccgca 
cgaagatttg 
atcaaggaag 
tacctctctg 
tgcttatcat 
tcgagaggag 
tctagctttc 
attgaactta 
actgtacttt 
acacctactc 
tagggagtca 
aaatgtggtt 
caatcagatc 
atgggacaaa 
attcccaagt 
aggacctgta 
acattttgaa 
cagctccgtt 
aaacaataaa 
ggacatctgt 
caatgtgaaa 
ttctgttcca 
ggatgcaaag 



aacagaggga 
tttgcatatg 
ggacctagta 
atcataggaa 
atacaatatc 
accccccaac 
ggaagtgaac 
ccgccacaag 
tggaggcctg 
cagggcagtg 
aaagagcagt 
tttcacgtgg 
gaacagaggc 
gaggcaatgg 
gcccacctgt 
aagaagcatt 
aaacattato 
aggtgtggaa 
gggaactata 
tttttcttcc 
gcattatata 
gtggttgata 
ttaattgtat 
tgagattttt 
ctcagctacc 
gtcatgtgca 
ttgaccttgt 
ttcaggatta 
ggcaaatgac 
ggcctgtttc 
catggctgtc 
aggaaacctg 
acctttgaat 
aagtgatgaa 
agtcagtttg 
gaagaacgtg 
attcacagag 
tggctgcctt 
ctcttggccc 
gcattttcct 
agtcattttt 
cagagaacca 
tatgaatgaa 
aaatagagta 
agaagaacat 
agagaggcca 
gcaggggcag 
cacctcaatt 
tgttcagatg 
ctaaagatgt 
ttataataat 
acctccagga 
tcctaaaaca 
taaggcagtg 
tatcagagaa 
gttgaaattc 
acacactttt 
tttattgtta 
ggacatttgt 
ttggatcagg 
ttggggtcga 
gtttgtggta 
tggcagtgca 
tgtgcagttt 
ttctaaatgt 
ctgaattgga 
tgtctttgga 
aaaatcagat 



39740-000lPCT.txt 
gggttcctgt ggggggaagt ccatgcctcc ctggcctgaa 
actcacatga tgcatacctg gtgggaggaa aagagttggg 
cccactgaga tttccacgcc gaaggacagc gatgggaaaa 
agtatttttt taagctacca attgtgccga gaaaagcatt 
atccagtacc ttaagccctg attgtgtata ttcatatatt 
tcccaatact ggctctgtct gagtaagaaa cagaatcctc 
atttcggtga cttccgcatc aggaaggcta gagttaccca 
tgcctgcttt taggagaccg aagtccgcag aacctgcctg 
gtcctggaac tgagccgggg ccctcactgg cctcctccag 
tggtctccga atgtctggaa gctgatggag ctcagaattc 
agaggggtgt ggctgggcct gtcaccctgg ggccctccag 
agcatgggag ccacgaccct tcttaagaca tgtatcactg 
cctgggccct tcctatcaga aggacatggt gaaggctggg 
ccacggccca ttttggctgt agcacatggc acgttggctg 
gagtttaaag caaggcttta aatgactttg gagagggtca 
gaagtgaggt gtcatggatt aattgacccc tgtctatgga 
ttgtcactgt agtttggttt tatttgaaaa cctgacaaaa 
tatgggggtt atctgtacat cctggggcat taaaaaaaaa 
aagaagtaac aaaagaagtg acatcttcag caaataaact 
agtttagaat cagccttgaa acattgatgg aataactctg 
ccatttatct gtattaactt tggaatgtac tctgttcaat 
tttcgaaagc tgctttaaaa aaatacatgc atctcagcgt 
ttagttatgg cctatacact atttgtgagc aaaggtgatc 
atctcttgat tcttcaaaag cattctgaga aggtgagata 
taagaaaaac ctggatgtca ctggccactg aggagctttg 
tttccacgtc aacagaattg tttattgtga cagttatatc 
ttcttgaagg tttcctcgtc cctgggcaat tccgcattta 
catgcatgtt tggttaaacc catgagattc attcagttaa 
cagcagattc aaatctatgg tggtttgacc tttagagagt 
aacacagacc y caeccagagc cctcctgccc tccttccgcg 
cttcagggtc ttcctgaaat gcagtggtgc ttacgctcca 
tggtatgaag ccagacctcc ccggcgggcc tcagggaaca 
gattctaatt tttaagcaaa atattatttt atgaaaggtt 
tatggaatat ccaatcctgt gctgctatcc tgccaaaatc 
cagtatgctc cacgtggtaa gatcctccaa gctgctttag 
gacgctttta atataaagcc tgttttgtct tctgttgttg 
tatttgaaaa atgtatatat attaagaggt cacgggggct 
ttgctgtggg gttttgttac ctggttttaa taacagtaaa 
cagaactgta cagtattgtg gctgcacttg ctctaagagt 
tattgttaaa aacatgttag aagcaatgaa tgtatataaa 
ttctcctctt cttttttttc attatatcta attattttgc 
tccctatttt gtattgaaga gggattcaca tctgcatctt 
aaaacagtcc tctgtatgta ctcctcttta cactggccag 
tatgcacttt ccaaattggg gacaagggct ctaaaaaaag 
ctgagaacct cctcggccct cccagtccct cgctgcacaa 
gaatgacagc tgacagggtc tatggccatc gggtcgtctc 
aaaactctgg caggcttaag atttggaata aagtcacaga 
tagttcaaac aagacgccaa cattctctcc acagctcact 
tggccttcca tttatatgtg atctttgttt tattagtaaa 
agctctggcc cagtgggaaa aattaggaag tgattataaa 
caagattaaa tgtaaataat cagggcaatc ccaacacatg 
tctattgagt gaacagaatt gcaaatagtc tctatttgta 
aatagtttat aaatgtgaac ttaaactcta attaattcca 
gctgttttta gactttctta tcacttatag ttagtaatgt 
aaacaggaaa ggctcgaaat acaagccatt ctaaggaaat 
tattctgatc ttattctgtg gtgtcttttg cagcccagac 
taagaaatac aattctacat tgtcaagctt atgaaggttc 
ttcaatttgg atctttcagg gatttttttt ttaaattatt 
tggaggggtg ggagggagga acaattttta aatataaaac 
gagttggaag ttttcagaat aaccagaact aagggtatga 
tgtgatgcct ctgcgaagaa ccttgtgtga caaatgagaa 
cgacctttag attccagaga catcagcatg gctcaaagtg 
atggtataaa tttcaagctg gatatgtcta atgggtattt 
taactaacag gatatttaat gacaaccttc tggttggtag 
ttattatgta caatacagaa aaaaatttta taaaattaag 
gagtgataat acaagtcctt tagtcttacc cagtgaatca 
caaccatgac cttggacaat catgaaatat gcatctcact 
ggagcatgaa tggtactgta ccggttcatc tggactgccc 
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1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2i60 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 



SUBSTITUTE SHEET (RULE 26) 



BNSDOCID: <WO 03076662A 1 _l_> 



WO 03/078662 



PCT/US03/07713 



cagaaaaata 
gagcacagaa 
tgagactatt 
ttttggaaaa 
ggtagaattt 
gttttgttta 
ggggttattt 
tttgttatca 
aagaggtggc 
atggttaagg 
tgttgttgga 
aagaaaaata 



acttcaagca 
gatgggaaca 
aataaataag 
tctgccgtgg 
gcaagagtga 
ttataccttc 
ttgttttaaa 
agattttcat 
tgatattctg 
taaagtaagt 
aaaagtcaca 
aagtacagtg 



39740-0001PCT.txt 
aacatcctat caacaacaag gttgttctgc ataccaagct 
ctaqtqqagg atggaaaggc tcgctcaatc aagaaaattc 
actgtagtgt agatactgag taaatccatg cacctaaacc 
qccctccaga tagctcattt cattaagttt ttccctccaa 
cagtggattg catttctttt ggggaagctt tcttttggtg 
ttaagttttc aaccaaggtt tgcttttgtt ttgagttact 
taaaaataag tgtacaataa gtgtttttgt attgaaagct 
acttttacct tccatggctc tttttaagat tgatactttt 
caacactgta cacataaaaa atacggtaag gatactttac 
ctccagttgg ccaccattag ctataatggc actttgtttg 
ttgccattaa actttccttg tctgtctagt taatattgtg 
tgagatactg 



5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6030 



<210> 290 
<211> 10987 
<212> DNA 

<213> Homo sapiens 



<400> 290 
ggtggcgcga 
tctgctgcgc 
acagatttgt 
ctggagcgga 
tccaaagaga 
ggaccaataa 
gaacctgcag 
ccacaaagga 
gggctgactc 
ttaggaagga 
gatcaagcag 
gttctacaat 
tttcatacac 
ggagctgagg 
agttctactg 
actactgcta 
agatttatcg 
ggatttggaa 
aagtcaatgc 
gaagatagtt 
actagcaaga 
aaaaaccaag 
ccattagatt 
aaggaagttg 
ggagcccaga 
gaaaaagacc 
tctttgccac 
gtaaataaga 
aagcaggcaa 
atattcagaa 
actgatccaa 
gtttgctcac 
gccaccacca 
aagaaaacaa 
ataccgaaag 
tttgaagcac 
agaagctgtt 
acaattctga 
gatcttgatt 
gaagctgatt 
aaagtttcag 
aaagtggaat 
aatgccagca 
atttctagag 
tctgatgttg 
aatgaaaatt 
ccttcaagaa 



gcttctgaaa 
ctcgggtgtc 
gaccggcgcg 
cttatttacc 
ggccaacatt 
gtcttaattg 
aagaatctga 
aaccatctta 
tgccgctgta 
atgttcccaa 
atgatgtttc 
gtacacatgt 
caaagtttgt 
tggatcctga 
tgctcatagt 
atgtgaaaag 
cttctgtgac 
aaacatcagg 
caaatgtcct 
tttcattatg 
ctaggaaaaa 
tgaaagaaaa 
caaatgtagc 
taccgtcttt 
tggagaaaat 
tattagacac 
gtatttctag 
gagatgaaga 
tatctggaac 
taagagaatc 
actttaaaaa 
agaaggagga 
cacagaattc 
ataagtttat 
accaaaaatc 
cacttacatt 
cacagaatga 
ggaaatgttc 
ataaagaagc 
ctctgtcatg 
atataaaaga 
acagtgatac 
ctcttatttt 
gcaaagaatc 
aattaaccaa 
ataaaaacgt 
aggtacaatt 



ctaggcggca 
ttttgcggcg 
gtttttgtca 
aagcattgga 
ttttgaaatt 
gtttgaagaa 
acataaaaac 
taatcagctg 
ccaatctcct 
tagtagacat 
ctgtccactt 
aacaccacaa 
gaagggtcgt 
tatgtcttgg 
cagaaatgaa 
ctatttttcc 
agacagtgaa 
gaattcattt 
agaagatgaa 
tttttctaaa 
aattttccat 
atactcattt 
acatcagaag 
ggcctgtgaa 
acccctattg 
agagaacaaa 
cctaccaaaa 
gcagcatctt 
ttctccagtg 
acctaaagag 
agaaactgaa 
ctccttatgt 
tgtagctttg 
ttatgctata 
agaactaatt 
tgcaaatgct 
ttctgaagaa 
tagaaatgaa 
aaaatgtaat 
cctgcaggaa 
agaggtcttg 
tgactttcaa 
aactcctact 
atacaaaatg 
aaatattccc 
tgagctgttg 
caaccaaaac 



gaggcggagc cgctgtggca 
gtgggtcgcc gccgggagaa 
gcttactccg gccaaaaaag 
ggaatatcgt aggtaaaaat 
tttaagacac gctgcaacaa 
etttcttcag aagctccacc 
aacaattacg aaccaaacct 
gcttcaactc caataatatt 
gtaaaagaat tagataaatt 
aaaagtcttc gcacagtgaa 
ctaaattctt gtcttagtga 
agagataagt cagtggtatg 
cagacaccaa aacatatttc 
tcaagttctt tagctacacc 
gaagcatctg aaactgtatt 
aatcatgatg aaagtctgaa 
aacacaaatc aaagagaagc 
aaagtaaata gctgcaaaga 
gtatatgaaa cagttgtaga 
tgtagaacaa aaaatctaca 
gaagcaaacg ctgatgaatg 
gtatctgaag tggaaccaaa 
ccctttgaga gtggaagtga 
tggtctcaac ^ taaccctttc 
catatttctt catgtgacca 
agaaagaaag attttcttac 
tcagagaagc cattaaatga 
gaatctcata cagactgcat 
gcttcttcat ttcagggtat 
actttcaatg caagtttttc 
gcctctgaaa gtggactgga 
ccaaatttaa ttgataatgg 
aagaatgcag gtttaatatc 
catgatgaaa cattttataa 
aactgttcag cccagtttga 
gattcaggtt tattgcattc 
ccaactttgt ccttaactag 
acatgttcta ataatacagt 
aaggaaaaac tacagttatt 
ggacagtgtg aaaatgatcc 
gctgcagcat gtcacccagt 
tcccagaaaa gtcttttata 
tccaaggatg ttctgtcaaa 
tcagacaagc tcaaaggtaa 
atggaaaaga atcaagatgt 
ccacctgaaa aatacatgag 
acaaatctaa gagtaatcca 
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ctgctgcgcc 
gcgtgagggg 
aactgcacct 
gcctattgga 
agcagattta 
ctataattct 
atttaaaact 
caaagagcaa 
caaattagac 
aaetaaaatg 
aagtcctgtt 
tgggagtttg 
tgaaagtcta 
acccaccctt 
tcctcatgat 
gaaaaatgat 
tgcaagtcat 
ccacattgga 
tacctctgaa 
aaaagtaaga 
tgaaaaatct 
tgatactgat 
caaaatctcc 
aggtctaaat 
aaaitatttca 
ttcagagaat 
ggaaacagtg 
tcttgcagta 
caaaaagtct 
aggtcatatg 
aatacatact 
aagctggcca 
cactttgaaa 
aggaaaaaaa 
agcaaatgct 
ttctgtgaaa 
ctcttttggg 
aatctctcag 
tattacccca 
aaaaagcaaa 
acaacattca 
tgatcatgaa 
cctagtcatg 
caattatgaa 
atgtgcttta 
agtagcatca 
aaaaaatcaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
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39740-000lPCT.txt 

gaagaaacta cttcaatttc aaaaataact gtcaatccag actctgaaga acttttctca 2880 
gacaatgaga ataattttgt ettccaagta gctaatgaaa ggaataatct tgctttagga 2940 
aatactaagg aacttcatga aacagacttg acttgtgtaa acgaacccat tttcaagaac 3000 
tctaccatgg ttttatatgg agacacaggt gataaacaag caacccaagt gtcaattaaa 3060 
aaagatttgg tttatgttct tgcagaggag aacaaaaata gtgtaaagca gcatataaaa 3120 
atgactctag gtcaagattt aaaatcggac atctccttga atatagataa aataccagaa 3180 
aaaaataatg attacatgaa caaatgggca ggactcttag gtccaatttc aaatcacagt 3240 
tttggaggta gcttcagaac agcttcaaat aaggaaatca agctctctga acataacatt 3300 
aagaagagca aaatgttctt caaagatatt gaagaacaat atcctactag tttagcttgt 3360 
gttgaaattg taaatacctt ggcattagat aatcaaaaga aactgagcaa gcctcagtca 3420 
attaatactg tatctgcaca tttacagagt agtgtagttg tttctgattg taaaaatagt 3480 
catataaccc ctcagatgtt attttccaag caggatttta attcaaacca taatttaaca 3540 
cctagccaaa aggcagaaat tacagaactt tctactatat tagaagaatc aggaagtcag 3600 
tttgaattta ctcagtttag aaaaccaagc tacatattgc agaagagtac atttgaagtg 3660 
cctgaaaacc agatgactat cttaaagacc acttctgagg aatgcagaga tgctgatctt 3720 
catgtcataa tgaatgcccc atcgattggt caggtagaca gcagcaagca atttgaaggt 3780 
acagttgaaa ttaaacggaa gtttgctggc ctgttgaaaa atgactgtaa caaaagtgct 3840 
tctggttatt taacagatga aaatgaagtg gggtttaggg gcttttattc tgctcatggc 3900 
acaaaactga atgtttctac tgaagctctg caaaaagctg tgaaactgtt tagtgatatt 3960 
gagaatatta gtgaggaaac ttctgcagag gtacatceaa taagtttatc ttcaagtaaa 4020 
tgtcatgatt ctgttgtttc aatgtttaag atagaaaatc ataatgataa aactgtaagt 4080 
gaaaaaaata ataaatgcca actgatatta caaaataata ttgaaatgac tactggcact 4140 
tttgttgaag aaattactga aaattacaag agaaatactg aaaatgaaga taacaaatat 4200 
actgctgcca gtagaaattc tcataactta gaatttgatg gcagtgattc aagtaaaaat 4260 
gatactgttt gtattcataa agatgaaacg gacttgctat ttactgatca gcacaacata 4320 
tgtcttaaat tatctggcca gtttatgaag gagggaaaca ctcagattaa agaagatttg 4380 
tcagatttaa cttttttgga agttgcgaaa gctcaagaag catgtcatgg taatacttca 4440 
aataaagaac agttaactgc tactaaaacg gagcaaaata taaaagattt tgagacttct 4500 
gatacatttt ttcagactgc aagtgggaaa aatattagtg tcgccaaaga gtcatttaat 4560 
aaaattgtaa atttctttga tcagaaacca gaagaattgc ataacttttc cttaaattct 4620 
gaattacatt ctgacataag aaagaacaaa atggacattc taagttatga ggaaacagac 4680 
atagttaaac acaaaatact gaaagaaagt gtcccagttg gtactggaaa tcaactagtg 4740 
accttccagg gacaacccga acgtgatgaa aagatcaaag aacctactct gttgggtttt 4800 
catacagcta gcgggaaaaa agttaaaatt gcaaaggaat ctttggacaa agtgaaaaac 4860 
ctttttgatg aaaaagagca aggtactagt gaaatcacca gttttagcca tcaatgggca 4920 
aagaccctaa agtacagaga ggcctgtaaa gaccttgaat tagcatgtga gaccattgag 4980 
atcacagctg ccccaaagtg taaagaaatg cagaattctc tcaataatga taaaaacctt 5040 
qtttctattg agactgtggt gccacctaag ctcttaagtg ataatttatg tagacaaact 5100 
gaaaatctca aaacatcaaa aagtatcttt ttgaaagtta aagtacatga aaatgtagaa 5160 
aaagaaacag caaaaagtcc tgcaacttgt tacacaaatc agtcccctta ttcagtcatt 5220 
gaaaattcag ccttagcttt ttacacaagt tgtagtagaa aaacttctgt gagtcagact 5280 
tcattacttg aagcaaaaaa atggcttaga gaaggaatat ttgatggtca accagaaaga 5340 
ataaatactg cagattatgt aggaaattat ttgtatgaaa ataattcaaa cagtactata 5400 
gctgaaaatg acaaaaatca tctctccgaa aaacaagata cttatttaag taacagtagc 5460 
atgtctaaca gctattccta ccattctgat gaggtatata atgattcagg atatctctca 5520 
aaaaataaac ttgattctgg tattgagcca gtattgaaga atgttgaaga tcaaaaaaac 5580 
actagttttt ccaaagtaat atccaatgta aaagatgcaa atgcataccc acaaactgta 5640 
aatgaagata tttgcgttga ggaacttgtg actagctctt caccctgcaa aaataaaaat 5700 
gcagccatta aattgtccat atctaatagt aataattttg aggtagggcc acctgcattt 5760 
aggatagcca gtggtaaaat cgtttgtgtt tcacatgaaa caattaaaaa agtgaaagac 5820 
atatttacag acagtttcag taaagtaatt aaggaaaaca acgagaataa atcaaaaatt 5880 
tgccaaacga aaattatggc aggttgttac gaggcattgg atgattcaga ggatattctt 5940 
cataactctc tagataatga tgaatgtagc acgcattcac ataaggtttt tgctgacatt 6000 
cagagtgaag aaattttaca acataaccaa aatatgtctg gattggagaa agtttctaaa 6060 
atatcacctt gtgatgttag tttggaaact tcagatatat gtaaatgtag tatagggaag 6120 
cttcataagt cagtctcatc tgcaaatact tgtgggattt ttagcacagc aagtggaaaa 6180 
tctgtccagg tatcagatgc ttcattacaa aacgcaagac aagtgttttc tgaaatagaa 6240 
gatagtacca agcaagtctt ttccaaagta ttgtttaaaa gtaacgaaca ttcagaccag 6300 
ctcacaagag aagaaaatac tgctatacgt actccagaac atttaatatc ccaaaaaggc 6360 
ttttcatata atgtggtaaa ttcatctgct ttctctggat ttagtacagc aagtggaaag 6420 
caagtttcca ttttagaaag ttccttacac aaagttaagg gagtgttaga ggaatttgat 6480 
ttaatcagaa ctgagcatag tcttcactat tcacctacgt ctagacaaaa tgtatcaaaa 6540 
atacttcctc gtgttgataa gagaaaccca gagcactgtg taaactcaga aatggaaaaa 6600 
acctgcagta aagaatttaa attatcaaat aacttaaatg ttgaaggtgg ttcttcagaa 6660 
aataatcact ctattaaagt ttctccatat ctctctcaat ttcaacaaga caaacaacag 6720 
ttggtattag gaaccaaagt ctcacttgtt gagaacattc atgttttggg aaaagaacag 6780 
gcttcaccta aaaacgtaaa aatggaaatt ggtaaaactg aaactttttc tgatgttcct 6840 
gtgaaaacaa atatagaagt ttgttctact tactccaaag attcagaaaa ctactttgaa 6900 
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acagaagcag 
ctgccaagtc 
tcaaattcaa 
atcaaaagaa 
aaggcttcaa 
gtttctttag 
cagaatccaa 
catctgactt 
gtttctgcta 
gtctttgttc 
aatattaact 
agtaaaaata 
gcagcagctg 
cagaatgcca 
ccacagccag 
gcagcagtag 
gtttctaaac 
gaagattatt 
ggatggctca 
tgtgacactc 
agatggatca 
agatgcctaa 
gatagaagca 
acacttgttc 
agcaataaaa 
tggtatgctg 
ctgacagttg 
tgtacacctc 
cctgctcgct 
ttatcatcgc 
gcatacccta 
agagaggaag 
ttattcacta 
ttaccatcac 
tatgaagcag 
cagttaagag 
cagttggaaa 
gatgtcacaa 
gttatactga 
agatacagaa 
atacagttag 
ttatttcaga 
tttcagccat 
acaggacttg 
ttttggatag 
ctccagtggc 
gtgttttctg 
actgttgaga 
catgcaaatg 
gctcaaatca 
tattatcaaa 
tcagcccaga 
tgcaaaaaga 
cccatttgta 
ggcaccaaat 
tttaaaaaat 
gcattgataa 
gtcagtgaat 
cgttgtacta 
gagaaaaata 
acaataaatt 
acacattagt 
tttgtgtatc 
. tgcatatctt 
ctcggtggct 
aggccaggag 
aaaaaaaagg 
ttacaatcaa 



tagaaattgc 
atgceacaca 
gaattggaaa 
acttattaaa 
aaagcactcc 
agccgattac 
attttaccgc 
tggaaaaatc 
caagaaatga 
caccttttaa 
tggaggaaaa 
agattaatga 
taactttcac 
gagatataca 
gcagtctgta 
gaggccaagt 
attgcataaa 
ttggtaagga 
taccctccaa 
caggtgtgga 
tatggaaact 
gcccagaaag 
gaagatcggc 
tctgtgtttc 
ctagtagtgc 
ttaaggccca 
gtcagaagat 
ttgaagcccc 
ggtataccaa 
ttttcagtga 
tacagtggat 
aaaaggaagc 
aaattcagga 
gtgcactaac 
tgaagaatgc 
ccttgaataa 
ttaggaaggc 
ccgtgtggaa 
gtatttggcg 
tttatcatct 
cagcgacaaa 
tttaccagcc 
cttgttctga 
_ cccctttcgt 
accttaatga 
gaccagaatc 
ctagtccaaa 
atattgacat 
atcccaagtg 
ttcctggtac 
gtcctttatc 
tgacttcaaa 
gaagagcctt 
catttgtttc 
acgaaacacc 
tcaatgaaat 
atacccaagc 
ccactaggac 
catctctgat 
agcaggacac 
attgacgctt 
acttatgttg 
gggcaaaaat 
aaaactaaat 
catgcctgta 
ttcaagacca 
ggaaaagaaa 
caaaatggtc 



39740-0001PCT.txt - 
taaagctttt atggaagatg atgaactgac agattctaaa 6960 
ttctcttttt acatgtcccg aaaatgagga aatggttttg 7020 
aaqaagagga gagcccctta tcttagtggg agaaccctca 7080 
tgaatttgac aggataatag aaaatcaaga aaaatcctta 7140 
agatggcaca ataaaagatc gaagattgtt tatgcat cat 7200 
ctqtgtaccc tttcgcacaa ctaaggaacg tcaagagata 7260 
acctggtcaa gaatttctgt ctaaatctca tttgtatgaa 7320 
ttcaagcaat ttagcagttt caggacatcc attttatcaa 7380 
aaaaatgaga cacttgatta ctacaggcag accaaccaaa 7440 
aactaaatca cattttcaca gagttgaaca gtgtgttagg 7500 
cagacaaaag caaaacattg atggacatgg ctctgatgat 7560 
caatgagatt catcagttta acaaaaaoaa ctccaatcaa 7620 
aaagtgtgaa gaagaacctt tagatttaat tacaagtctt 7680 
ggatatgcga attaagaaga aacaaaggca acgcgtcttt 7740 
tcttgclala acatccactc tgcctcgaat ctctetgaaa 7800 
tccctctgcg tgttctcata aacagctgta tacgtatggc 7860 
aattaacagc aaaaatgcag agtcttttca gtttcacact 7920 
aagtttatgg actggaaaag gaatacagtt ggctgatggt 7980 
tgatggaaag gctggaaaag aagaatttta tagggctctg 8040 
tccaiigctt atttctagaa tttgggttta taatcactat 8100 
ggcagctatg gaatgtgcct ttcctaagga atttgctaat 8160 
ggtgcttctt caactaaaat acagatatga tacggaaatt 8220 
tataaaaaag ataatggaaa gggatgacac agctgcaaaa 8280 
tgacataatt tcattgagcg caaatatatc tgaaacttct 8340 
agatacccaa aaagtggcca ttattgaact tacagatggg 8400 
gttagatcct cccctcttag ctgtcttaaa gaatggcaga 8460 
tattcttcat ggagcagaac tggtgggctc tcctgatgcc 8520 
agaatctctt atgttaaaga tttctgctaa cagtactcgg 8580 
acttggattc tttcctgacc ctagaccttt tcctctgccc 8640 
tggaggaaat gttggttgtg ttgatgtaat tattcaaaga 8700 
ggigalgaca tcatctggat tatacatatt tcgcaatgaa 8760 
agcaaaatat gtggaggccc aacaaaagag actagaagcc 8820 
ggaatttgaa gaacatgaag aaaacacaac aaaaccatat 8»»u 
aagacagcaa gttcgtgctt tgcaagaugg tgcagagctt 8940 
agcagaccca gcttaccttg agggttattt cagtgaagag 9000 
tcacaggcaa atgttgaatg ataagaaaca agctcagatc 9060 
catggaatct gctgaacaaa aggaacaagg tttatcaagg 9120 
qttgcgtatt gtaagctatt caaaaaaaga aaaagattca 9180 
tccatcatca gatttatatt ctctgttaac agaaggaaag 9240 
tgcaacttca aaatctaaaa gtaaatctga aagagctaac 9300 
aaaaactcag tatcaacaac taccggtttc agatgaaatt 9360 
acgggagccc cttcacttca gcaaattttt agatccagac 9420 
ggtggaccta ataggatttg tcgtttctgt tgtgaaaaaa 9480 
ctatttgtca gacgaatgtt acaatttact ggcaataaag 9540 
qgacattatt aagcctcata tgttaattgc tgcaagcaac 9600 
caaatcaggc cttcttactt tatttgctgg agatttttct 9660 
agagggccac tttcaagaga cattcaacaa aatgaaaaat 9720 
actttgcaat gaagcagaaa acaagcttat gcatatactg 9780 
qtccacccca actaaagact gtacttcagg gccgtacact 9840 
iggaaacaag cttctgatgt cttctcctaa ttgtgagata 9900 
actttgtatg gccaaaagga agtctgtttc cacacctgtc 9960 
gtcttgtaaa ggggagaaag agattgatga ccaaaagaac 10020 
ggatttcttg agtagactgc ctttacctcc acctgttagt 10080 
tccggctgca cagaaggcat ttcagccacc aaggagttgt 10140 
catlaagaaa aalgaactga attctcctca gatgactcca 10200 
ttctcttttg gaaagtaatt caatagctga cgaagaactt 10260 
tcttttgtct ggttcaacag gagaaaaaca atttatatct 10320 
tgctccEacc agttcagaag attatctcag actgaaacga 10380 
ciaagaacag gagagttccc aggccagtac 99aagaatgt 10440 
aattacaact aaaaaatata tctaagcatt tgcaaaggcg 10500 
aacctttcca gtttataaga ctggaatata atttcaaacc 
cacaatgaga aaagaaatta gtttcaaatt tacctcagcg 
cgttttgccc gattccgtat tggtatactt ttgcttcagt 



10560 
10620 
10680 



gtaatttatt lactaatcaa gaaaaacatc tttggctgag 10740 
Itcccaacac tttgagaagc tgaggtggga ggagtgcttg 10800 
qcctgggcaa catagggaga cccccatctt tacgaagaaa 10860 
atcttttaaa tctttggatt tgatcactac aagtattatt 10920 
atccaaactc aaacttgaga aaatatcttg ctttcaaatt 10980 
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gacacta 



10987 



<210> 291 

<211> 1552 

<212> DNA 

<213> Homo sapiens 



<400> 291 

gcccgtacac 

agcccctggc 

ctgtcactgc 

tcccccttgg 

agtgaagagg 

gaggatctac 

tccctgaagt 

aataatgccc 

ccgccctggc 

cgcccccagc 

ctcccgccgc 

cctcctgggc 

ctgcactggg 

cctgccgaga 

gggcgcccgg 

agtgcctatg 

caggtcccag 

tatgaggggt 

cagacagtga 

ggtgactctc 

gaggcctcct 

aattcctgcc 

accagcgtcg 

gtgagctacc 

tgtgagaagc 

atgccacttc 



accgtgtgct 
tccctctgtt 
tgcttctgat 
gaggaggctc 
attcacccag 
ctggagagga 
tagaggatct 
acagggacaa 
cccgggtgtc 
tcgccgcctt 
tcccagaact 
tagagatggc 
gggctgcagg 
tccacgtggt 
gaggcctggc 
agcagttgct 
gactggacat 
ctctgactac 
tgctgagtgc 
ggctacagct 
tccctgctgg 
tggctgctgg 
cgttccttgt 
gcccagcaga 
cagccagagg 
cttttaactg 



gggacacccc 
gatcccggcc 
gcctgtccat 
ttctggggaa 
agaggaggat 
ggatctacct 
acctactgtt 
agaaggggat 
cecagcctgc 
ctgcccggcc 
gcgcctgcgc 
tctgggtccc 
tcgtccgggc 
tcacctcagc 
cgtgttggcc 
gtctcgcttg 
atctgcactc 
accgccctgt 
taagcagctc 
gaacttccga 
agtggacagc 
tgacatccta 
gcagatgaga 
ggtagccgag 
catctgaggg 
ccaagaaatt 



acagtcagcc 
cetgctccag 
ccccagaggt 
gatgacccac 
ccacccggag 
gaagttaagc 
gaggctcctg 
gaccagagtc 
gcgggccgct 
ctgcgccccc 
aacaatggcc 
gggcgggagt 
tcggagcaca 
accgcctttg 
gcctttctgg 
gaagaaatcg 
ctgccctctg 
gcccagggtg 
cacaccctct 
gcgacgcagc 
agtcctcggg 
gccctggttt 
aggcagcaca 
actggagcct 
ggagccggta 
ttttaaaata 



gcatggctcc 
gcctcactgt 
tgccccggat 
tgggcgagga 
aggaggatct 
ctaaatcaga 
gagatcctca 
attggcgcta 
tccagtcccc 
tggaactcct 
acagtgtgca 
accgggctct 
ctgtggaagg 
ccagagttga 
aggagggccc 
ctgaggaagg 
acttcagccg 
tcatctggac 
ctgacaccct 
ctttgaatgg 
ctgctgagcc 
ttgigcctcct 
gaaggggaac 
agaggctgga 
actgtcctgt 
aatatttata 



cctgtgcccc 
gcaactgctg 
gcaggaggat 
ggatctgccc 
acctggagag 
agaagagggc 
agaaccccag 
tggaggcgac 
ggtggatatc 
gggcttccag 
actgaccctg 
gcagctgcat 
ccaccgtttc 
cgaggccttg 
ggaagaaaac 
ctcagagact 
ctacttccaa 
tgtgtttaac 
gtggggacct 
gcgagtgatt 
agtccagctg 
ttttgctgtc 
caaagggggt 
tcttggagaa 
cctgctcatt 
at 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1552 



<210> 292 

<211> 1578 

<212> DNA 

<213> Homo sapiens 



<400> 292 

acgaacaggc 

tctcggcgtg 

cgggcctccg 

gcgctccgag 

gcaggcgcaa 

agaacagctc 

atgaagaagg 

cctcttgaaa 

gaacctgagc 

gttgatactg 

ctgtgtcagg 

gctgatccaa 

gaggaagagc 

agagccatcc 

accatgtaca 

aagatgctgc 

taccctccag 

atcagacaga 

cctttgcact 

ttggccaaat 

tctcaaattg 

acaccaactc 

cacctggcta 

aacaagtatg 

ctagttcaag 



caataaggag 
ctgcggcgga 
gtgttctgct 
tcaccaggaa 
agcgcgttcc 
ttggggacat 
aagcaaaacc 
aggtacctat 
cagaacctga 
cctctccaag 
ctttctctga 
acctttgtag 
aagcagtcag 
taattgactg 
tgactgtctc 
agctggttgg 
aaattggtga 
tggaaatgaa 
tccttcggag 
acctgatgga 
cagcaggagc 
tacaacatta 
agaatgtagt 
ccacatcgaa 
atttagccaa 



ggagcagtgc 
acggctgttg 
tctccccgct 
ctcgaaaatt 
tacggcccct 
tggtaacaaa 
ttcagctact 
gctggtgcca 
gcctgttaaa 
cccaatggaa 
tgtaattctt 
tgaatatgtg 
accaaaatac 
gctagtacag 
cattattgat 
tgtcactgcc 
ctttgctttt 
gattctaaga 
agcatctaag 
actaactatg 
tttttgctta 
cctgtcatat 
catggtaaat 
gcatgctaag 
ggctgtggca 



ggggtttaaa tctgaggcta ggctggctct 60 
gtttctgctg gttgtaggtc cttggctggt 120 
gagctgctgc ctggtgaaga ggaagccatg 180 
aatgctgaaa ataaggcgaa gatcaacatg 240 
gctgcaacct ccaagcccgg actgaggcca 300 
gtcagtgaac aactgcaggc caaaatgcct 360 
ggaaaagtca ttgataaaaa actaccaaaa 420 
gtgccagtgt ctgagccagt gccagagcca 480 
gaagaaaaac tttcgcctga gcctattttg 540 
acatctggat gtgcccctgc agaagaagac 600 
gcagtaaatg atgtggatgc agaagatgga 660 
aaagatattt atgcttatct gagacaactt 720 
ctactgggtc gggaagtcac tggaaacatg 780 
gttcaaatga aattcaggtt gttgcaggag 840 
cggttcatgc agaataattg tgtgcccaag 900 
atgtttattg caagcaaata tgaagaaatg 960 
gtgactgaca acacttatac taagcaccaa 1020 
gctttaaact ttggtctggg tcggcctcta 1080 
attggagagg ttgatgtcga gcaacatact 1140 
ttggactatg acatggtgca ctttcctcct 1200 
gcactgaaaa ttctggataa tggtgaatgg 1260 
actgaagaat ctcttcttcc agttatgcag 1320 
caaggactta caaagcacat gactgtcaag 1380 
atcagcactc taccacagct gaattctgca 1440 
aaggtgtaac ttgtaaactt gagttggagt 1500 
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actatattta caaataaaat tggcaccatg tgccatctgt aaaaaaaaaa aaaaaaaaaa 1560 
aaaaaaaaaa aaaaaaaa 

<210> 293 
<2U> 3195 , 
<212> DNA . 
<213> Homo sapiens 



<400> 293 
agaggcttcc 
cctctcctcc 
cacccctcgc 
tgcgcccggc 
gagcccgcgc 
ggccacctcc 
gccgcttcct 
agccgcagcc 
tctgaatcct 
ccccacatgg 
aacgagcagt 
cccgtgcttc 
ggcagtggag 
ggggtgggag 
gaccctcctc 
acacatccca 
cagagaccca 
gaggagctca 
gaggaagatg 
cccccaggca 
gaaaaggacc 
tgcagcgtga 
gtgcagaata 
cctaaagccc 
gacagtgacc 
gacggaaagc 
ggcaagtrca 
tatgaaggcg 
ttcctactga 
cactgtgaat 
cgtgctgtca 
tacaaggagt 
aaccacgagg 
ggggagcgga 
agtcctgcta 
gctgagggcc 
ctgtctgccc 
cccacccctg 
ggcagtattt 
tcctgtgtcc 
ccctgaggat 
gtacttcccg 
tgcccctttc 
agctcttact 
catctcagga 
ggtcccagtt 
ggttaaggcc 
gcctgactgc 
ggtggatggc 
gcgttttgtt 
aaatatttac 
caaggactga 
gccctgctga 
aaaaaaaaaa 



ctggctggtg 
ctcgcgcctg 
ccgctgcctc 
cctccagcca 
caggctcggc 
cgggcctcct 
cgccggtcac 
gcctgacgca 
ccgaatcttc 
cggagcagac 
ttgccatcag 
ggaacatcac 
ctgccagcag 
ctctccggga 
tcccatcttg 
gctccaccca 
gctcggcccc 
gccccctggc 
atggatttgt 
tggagagtct 
tcgtcatgta 
tccggcccat 
agcggaggcg 
gcgtcctccg 
accgagagct 
accaagacct 
gcaacatcgt 
ggcacatcaa 
agagccccat 
tctcatctga 
acgactaccc 
tcttccctca 
ccttcaagga 
gccggcggga 
cctcccttgc 
tgctggaggc 
cagcccagat 
gaagagccca 
tgtgtcctcc 
tgaaacgctc 
gggtcagagc 
ggccagggct 
tcttttcccc 
ctttcctatt 
cacttccgta 
ttgttgcccc 
tgaatcatga 
tcagaacttg 
cgtggatggc 
gagcatgaca 
acttagggtt 
gcaccctctg 
gtcatctgtt 
aaaaa 



cctgagcccg 
gccctgtggc 
cctcggccca 
gccttctgcc 
tctcagtcca 
gctgggatct 
caccctcacc 
cctatccctg 
tgatgcaggt 
gtttgaacag 
acgcttccag 
caactcccag 
ctctggggaa 
agaggagggg 
gctgcaggat 
tgctctggca 
cgacctgatg 
cctaggtcgc 
ggacatccta 
cattagtgcc 
cagcaagtgc 
cctcaagagg 
gagcgtgacc 
ctcaaaatca 
gattggagat 
caagtacatc 
ggataagttr 
gactgcggtg 
cgcgccctgt 
gcgtgggccc 
cagcctctac 
gcacccgaac 
tgagctaaag 
gctctgtagc 
ctttcgaggc 
ctcaggtgct 
tcccctgtgt 
gtctgttgag 
aggagcttct 
ctttgtgtgt 
taaactcctt 
gcccctaatc 
tttcctgtcc 
tcagtgttac 
gactgtttag 
agaaagggat 
gcctgctgga 
ctgctgtctt 
cgtggatgcg 
cctgcagcag 
tggagctatt 
gattctgaat 
agggccttgg 



gcgtccctcg 
tcttcctccc 
gccagctgtg 
ggccccgccg 
gcaggcgtgt 
catggcctcc 
cagaccatgc 
tctcgacggg 
ctctgcatgg 
gccatccagg 
tctatgccgg 
gcgcccgacg 
gacaaggaga 
gcatgctggg 
ggatttgtct 
gagtgggcca 
tgtctcagtc 
ttctctctga 
gagagtgact 
ccactggtca 
cagcggctct 
ctggagcggc 
cctcctgagg 
ctgtgtcacg 
tactctaagg 
tcaccagaaa 
gtgattgtag 
aacttgcccc 
agcctggaca 
cgcatgtgcc 
taccctgaga 
ttctgtgaac 
accttccgcc 
cggctgcagg 
ctgaagccag 
gtccatggga 
catcccatca 
ttagttaagt 
tgtttccttg 
gtgtcagctg 
cctggcctga 
tctgtaggaa 
caccatacga 
ctgtgtgctt 
gttcccctgt 
gttattatcc 
agcccagccc 
gttgcggatg 
cagtgccttg 
gaatatatgt 
caagaggaaa 
ctcaagatgg 
ttcaataaag 



ccccccgccc 
tccctccttc 
ccggcgtttg 
cgatggaggt 
gcggtggcgc 
tggggtcccc 
acgacctcgc 
catccgaatc 
attcccccag 
cagccagccg 
tgaggctgct 
gccggaggaa 
atgtgcgctt 
gtggttccct 
tcaagatgcc 
gccgcaggga 
ctgaccggaa 
cccctgcaga 
taaaggatga 
agaccttgga 
tccgctctcc 
cccaggacag 
agcagcagga 
atgagatcga 
ccttcctcct 
cgatggtggc 
actgcagata 
tggaacgcga 
agagagtcat 
gtttcatcag 
tgtatatcct 
cccaggacta 
tcaagactcg 
accagtgagg 
ctgccctatg 
aagatggtgt 
ttttccatat 
tgggttaata 
ttagggttaa 
aggctgggga 
gagtcagctc 
ccgtggtatg 
gcacctccag 
ggtctgtttg 
caaatatcag 
ttgggggctc 
ctactgctgt 
gatggaaggt 
catacccaaa 
gtgcctattt 
tgtcacagaa 
gggcagggct 
cactgagcaa 



tccccgcatc 
cccccccccc 
ttggctgccc 
gccccagccg 
ccagcgtccg 
ggtgcgggcg 
cgggctcggc 
ctccctgtcg 
ccctatggac 
gatcattcga 
gggccacagc 
gagcgaggcg 
ctggaaggcc 
ggcatgtgag 
atggaagccc 
agcctttgcc 
gatggaagtg 
gggggatact 
tgatgcagtt 
aaaggaagag 
gtccatgccc 
ggacacgccc 
ggctgaggaa 
gaacctcctg 
acagacagta 
cctattgacg 
cccctatgaa 
cgccgagagc 
cctcattttc 
ggaacgagac 
gaaaggcggc 
ccggcccatg 
cagctgggct 
ggcctgcgcc 
ggcctgccgg 
ggtgtcctgc 
cctggtgccc 
ccagcttaaa 
cccttcatct 
gagccgtggt 
tctgccctgt 
tctgccatgt 
cctgaacaga 
actttacgcc 
ttacccactc 
ccagggcaag 
gaaccctggg 
tggatggatg 
ccaggtggga 
gtgtggacaa 
gcagctaaac 
gtgcttgaag 
gttgagaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3195 



<210> 294 

<211> 3737 

<212> DNA 

<213> Homo sapiens 
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<400> 294 
ggcgtccgcg 
ccgcaaccgc 
cggtgctgct 
cgggtcgggg 
atgactgcca 
agcctggcta 
tcaatggagg 
ttgatggctt 
agaacaatgg 
gcaaggaggg 
gcctgagctg 
gcagcgtcgc 
tcttgacctg 
gcccagagtg 
agcgagagga 
ataaacgggt 
accgcacctg 
tccagttgga 
gtgatcattt 
aattattaac 
gtgaccacag 
ccctgtatgg 
gtcagcaggt 
agctccactg 
caccccgtgt 
gtcactctgg 
agctaaatga 
cagcactacc 
gcagctctgg 
ttatcactgt 
tgagctgcat 
aggccgtcca 
aaaagcctcc 
cagaaaacca 
gcattttatg 
gcccaagacc 
gtggaggtct 
gtgccctggg 
gccttgccac 
gttcacctgg 
cataccagcc 
actttgatgg 
gagatttcac 
agtgtacgtg 
tcttcctgcc 
ccaattctgt 
ccaggtcaaa 
tccaggtccc 
gagatggcag 
tcaaggctct 
agtcccgaga 
ttttgagacc 
ggttggtggg 
ccgtatcagt 
gaacttggtt 
cagcttctca 
trggtcagcc 
tgtagtggaa 
ccggccctct 
caagagggga 
actcagtttc 
agttctaagc 
agcacttctg 

<210> 295 
<211> 2042 



cacacctccc 
tgagccatcc 
gctgctgctg 
ccgtgccgcg 
tgccgacgcc 
ccaaggggaa 
ctgtgtccat 
catgttggct 
cggctgccag 
gtttttcctg 
catgaataag 
ctgtgagtgc 
taaccatggg 
cagctgccat 
cactgtcctg 
gaaacggcgg 
taaggatact 
tgggaagaca 
ctgcaaaaac 
agatgagaag 
ctgcatcaac 
cttcacccac 
ctgtgtgaac 
gaataaaaaa 
gtccctgcac 
cattcacctc 
aggcaagtgt 
agagaagcac 
caagcaagtc 
tgagtttgag 
cgtaaagcga 
cagggagcag 
cagaacatct 
atgtgtcagt 
tccaaatgga 
aggaaattct 
gtgtcaacct 
cacgttccag 
caaacatcag 
acatttctac 
tgaatttgga 
ctccacaaac 
tgggtacatt 
gaccatcaac 
catagaggac 
gacaacatat 
gaagctgtgg 
atacgtgaca 
gctctatgca 
gtttgatgtc 
gatgtttcca 
ttacaaatga 
acagagctgt 
gactcattag 
tttctttccc 
ctgctgtggg 
taggtgagac 
aggaggccac 
ctaagggagc 
gggaaggaga 
tccacagcct 
agtgctcgtg 
gagacat 



cgcgccgccg 

atgggggtcg 

ctgctgccgc 

gggccgcagg 

ctgtgtcaga 

ggcaggcagt 

gactgtttga 

catgacggtc 

catacctgtg 

agtgacaatc 

gatcacggct 

aggcctggtt 

aacggtgggt 

ccacagtaca 

gaggtgacag 

ctgctcatgg 

tcgacaggtg 

tgtaaagata 

atcgtgggca 

tcttgccaag 

caccctggca 

tgtggagaca 

acagtgggca 

gactgtgtgg 

tgcggtaaga 

tcttcagatg 

agtttgaaaa 

agctcagtaa 

ccaggagccc 

cttgaaacta 

accgagaagc 

tttcacctcc 

gaacgccagg 

tgcagggctg 

accttccaaa 

ggggccctga 

ggtgaatatt 

cctgaagctg 

ggagctactn 

aacaccacca 

aaaaataatt 

ataacccagt 

gaatecccaa 

ccacccccca 

gactgtgggg 

gaaacctgcc 

attcagttca 

tatgatgagg 

tctgagaacc 

ctggcccatc 

agatcgttca 

ctcagcccac 

cttccttctg 

agttcaattt 

agcatcgtgg 

cggatgtctt 

tcacctgtcc 

agaataagct 

cctctgcact 

cccctgcagg 

tctccagcct 

aaaaaaaaaa 



ccgccaccgc 
cgggccgcaa 
cactgctgct 
aggatgtaga 
acacacccac 
gtgaggacat 
atattccagg 
ataattgtct 
tcaaegtcat 
agcacacctg 
gtagtcacat 
ttgagctggc 
gccagcactc 
agatgcacac 
agagcaacac 
aaacgtgtgc 
tccactgcag 
ttgatgagtg 
gttttgactg 
atgtggatga 
catttgcttg 
ccaatgagtg 
gctatgaatg 
aagtgaaggg 
gtggtggagg 
tcaccaccat 
atgctgagct 
aagagagctt 
ctggccgacc 
accaaaagga 
ggctccgtaa 
agctctcagg 
cagagtcctg 
ggacctatta 
atgaggaagg 
agaccccaga 
ctgcagatgg 
gtcgaacttc 
cctttcagga 
ctcaccgatg 
gtgtttcttg 
gtaaaaacag 
actacccagg 
agcgccgcat 
actatctggt 
agacctacga 
agtccaatga 
actaccagga 
atcaggaaat 
cccagaacta 
tccgattgct 
gtgccactca 
catgtcagca 
ttatagataa 
atgtagactg 
ggatagatca 
ttctggggtc 
gcttattctg 
cgtgtgcagg 
ctccctccac 
gtgtgataca 
gcagaaagaa 



ccgcactccg 
ccgtcccggg 
gctggcgggg 
tgagtgtgcc 
ctcctacaag 
cgatgaatgt 
caattatcgt 
tgatgtggac 
ggggaxjctat 
cattcaccgc 
ctgcaaggag 
caagaaccag 
ctgtgacgat 
agatgggagg 
cacatcagtg 
tgtcaacaat 
ttgtcctgtt 
ccagacccgc 
cggctgcaag 
gtgctctttg 
tgcttgcaac 
cagcatcaac 
ccagtgccac 
gctcctgccc 
agacgggtgc 
caggacaagt 
gtttcccgag 
ccgctacgta 
aagcacccct 
ggtgacagct 
agccatccgc 
catgaacctc 
tggagtgggc 
tgatggagca 
acaaatgact 
agcttggaat 
ctttgcacct 
ctgcttcccc 
ctgtgaaacc 
tattcgttgc 
cccaggaaat 
aagatgtgga 
caattaccca 
cctgatcgtg 
gatgcggaaa 
acgccccatc 
agggaacagc 
actcattgaa 
acttaaggat 
tttcaagtac 
acgttccaaa 
atacaaatgt 
cagtcgggta 
tacagatatt 
agaatggctt 
cgggctggct 
ttactcctcc 
aaacttcagc 
ctctgaccag 
ccaccttgag 
agtttgatcc 
ttagaaataa 



ccgcctctgc 
gcggcctggg 
gccgtcccgc 
caagggctag 
tgctcctgca 
ggaaatgagc 
tgcacttgtt 
gagtgcctgg 
gagtgctgct 
tcggaagagg 
gccccaaggg 
agagactgca 
acagccgatg 
agctgccttg 
gtggatgggg 
ggaggctgtg 
ggattcactc 
aatggaggtt 
aaaggattta 
gataggacct 
cgagggtaca 
aacggaggct 
cctgggtaca 
acaagtgtgt 
ttcctcagat 
gtaaccttta 
ggtctgcgac 
aaccttacat 
aaggaaatgt 
tcttgtgacc 
acgctcagaa 
gacgtggcta 
cagggtcatg 
cgagaacgct 
tgtgaaccat 
atgtctgaat 
tgccagctct 
tgtggaggag 
agagttcaat 
ccagtgggaa 
actacgactg 
ggggagctgg 
gccaacaccg 
gtccctgaga 
acctcttcat 
gccttcacct 
gctagagggt 
gacatagttc 
aagaaactta 
acagcccagg 
gtgtccaggt 
tctgctatag 
ttgctgcctc 
ttggtaaatt 
tgagtggcat 
gagctggact 
tcaaggagtc 
ttcctctagc 
gcagaacagg 
acctgggagg 
caggaacttg 
ataaaaacta 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3737 
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SlnmHil 
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<210> 296 
<211> 2547 
<212> DNA 

<213> Homo sapiens 
<400> 296 



subbbsi 

cttctgggaa cccctgtcgg "aaaacccc cacacccgga gcaaag^ yy 4gQ 

8SSSS8S SSgSgS f Ss8 SS^S & 

BBBMM 

l£?.?t|ct gi«a???« gSSSS ??XK.S? SSSSc S^PSJg 1260 
SSS&S aa 9 gaatacgcVccttt g aataacaatx 1320 
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ctgaaattgc actgtcacta agcagaaata aagtttttgt cttttttgat ctgactgtag 1380 
atgatcagtc agtttatcct aaggcattaa gagatgaata catcatgtca aaaactcttg 1440 
gaagtggtgc ctgtggagag gtaaagctgg ctttcgagag gaaaacatgt aagaaagtag 1500 
ccataaagat catcagcaaa aggaagtttg ctattggttc agcaagagag gcagacccag 1560 
ctctcaatgt tgaaacagaa atagaaattt tgaaaaagct aaatcatcct tgcatcatca 1620 
agattaaaaa cttttttgat gcagaagatt attatattgt tttggaattg atggaagggg 1680 
gagagctgtt tgacaaagtg gtggggaata aacgcctgaa agaagctacc tgcaagctct 1740 
atttttacca gatgctcttg gctgtgcagt accttcatga aaacggtatt atacaccgtg 1800 
acttaaagcc agagaatgtt ttactgtcat ctcaagaaga ggactgtctt ataaagatta 1860 
ctqattttgg gcactccaag attttgggag agacctctet catgagaacc ttatgtggaa 1920 
cccccaccta cttggcgcct gaagttcttg tttctgttgg gactgctggg tataaccgtg 1980 
ctgtggactg ctggagttta ggagttattc tttttatctg ccttagtggg tatccacctt 2040 
tctctgagca taggactcaa gtgtcactga aggatcagat caccagtgga aaatacaact 2100 
tcattcctga agtctgggca gaagtctcag agaaagctct ggaccttgtc aagaagttgt 2160 
tggtagtgga tccaaaggca cgttttacga cagaagaagc cttaagacac ccgtggcttc 2220 
aggatgaaga catgaagaga aagtttcaag atcttctgtc tgaggaaaat gaatccacag 2280 
ctctacccca ggttctagcc cagccttcta ctagtcgaaa gcggccccgt gaaggggaag 2340 
ccgagggtgc cgagaccaca aagcgcccag ctgtgtgtgc tgctgtgttg tgaactccgt 2400 
ggtttgaaca cgaaagaaat gtaccttctt tcactctgtc atctttcttt tctttgagtc 2460 
tgttttttta tagtttgtat tttaattatg ggaataattg ctttttcaca gtcactgatg 2520 
tacaattaaa aacctgatgg aacctgg 2547 

<210> 297 
<211> 2768 
<212> DNA 

<213> Homo sapiens 

x <400> 297 

cactgctgtg cagggcagga aagctccatg cacatagccc agcaaagagc aacacagagc 60 
tgaaaggaag actcagagga gagagataag taaggaaagt agtgatggct ctcatcccag 120 
acttggccat ggaaacctgg cttctcctgg ctgtcagcct ggtgctcctc tatctatatg 180 
gaacccattc acatggactt tttaagaagc ttggaattcc agggcccaca cctctgcctt 240 
ttttgggaaa tattttgtcc taccataagg gcttttgtat gtttgacatg gaatgtcata 300 
aaaagtatgg aaaagtgtgg ggcttttatg atggtcaaca gcctgtgctg gctatcacag 360 
atcctgacat gatcaaaaca gtgctagtga aagaatgtta ttctgtcttc acaaaccgga 420 
ggccttttgg tccagtggga tttatgaaaa gtgccatctc tatagctgag gatgaagaat 480 
ggaagagatt acgatcattg ctgtctccaa ccttcaccag tggaaaactc aaggagatgg 540 
tccctatcat tgcccagtat ggagatgtgt tggtgagaaa tctgaggcgg gaagcagaga 600 
caggcaagcc tgtcaccttg aaagacgtct ttggggccta cagcatggat gtgatcacta 660 
gcacatcatt tggagtgaac atcgactctc tcaacaatcc acaagacccc tttgtggaaa 720 
acaccaagaa gcttttaaga tttgattttt tggatccatt ctttctctca ataacagtct 780 
ttccattcct catcccaatt cttgaagtat taaatatctg tgtgtttcca agagaagtta 840 
caaatttttt aagaaaatct gtaaaaagga tgaaagaaag tcgcctcgaa gatacacaaa 900 
agcaccgagt ggatttcctt cagctgatga ttgactctca gaattcaaaa gaaactgagt 960 
cccacaaagc tctgtccgat ctggagctcg tggcccaatc aattatcttt atttttgctg 1020 
gctatgaaac cacgagcagt gttctctcct tcattatgta tgaactggcc actcaccctg 1080 
atgtccagca gaaactgcag gaggaaattg atgcagtttt acccaataag gcaccaccca 1140 
cctatgatac tgtgctacag atggagtatc ttgacatggt ggtgaatgaa acgctcagat 1200 
tattcccaat tgctatgaga cttgagaggg tctgcaaaaa agatgttgag atcaatggga 1260 
tgttcattcc caaaggggtg gtggtgatga ttccaagcta tgctcttcac cgtgacccaa 1320 
agtactggac agagcctgag aagttcctcc ctgaaagatt cagcaagaag aacaaggaca 1380 
acatagatcc ttacatatac acaccctttg gaagtggacc cagaaactgc attggcatga 1440 
ggtttgctct catgaacatg aaacttgctc taatcagagt ccttcagaac ttctccttca 1500 
aaccttgtaa agaaacacag atccccctga aattaagctt aggaggactt cttcaaccag 1560 
aaaaacccgt tgttctaaag gttgagtcaa gggatggcac cgtaagtgga gcctgaattt 1620 
tcctaaggac ttctgctttg ctcttcaaga aatctgtgcc tgagaacacc agagacctca 1680 
aattactttg tgaatagaac tctgaaatga agatgggctt catccaatgg actgcataaa 1740 
taaccgggga ttctgtacat gcattgagct ctctcattgt ctgtgtagag tgttatactt 1800 
gggaatataa aggaggtgac caaatcagtg tgaggaggta gatttggctc ctctgcttct 1860 
cacgggacta tttccaccac ccccagttag caccattaac tcctcctgag ctctgataag 1920 
agaatcaaca tttctcaata atttcctcca caaattatta atgaaaataa gaattatttt 1980 
gatggctcta acaatgacat ttatatcaca tgttttctct ggagtattct ataagtttta 2040 
tgttaaatca ataaagacca ctttacaaaa gtattatcag atgctttcct gcacattaag 2100 
gagaaatcta tagaactgaa tgagaaccaa caagtaaata tttttggtca ttgtaatcac 2160 
tgttggcgtg gggcctttgt cagaactaga atttgattat taacataggt gaaagttaat 2220 
ccactgtgac tttgcccatt gtttagaaag aatattcata gtttaattat gccttttttg 2280 
atcaggcaca gtggctcacg cctgtaatcc tagcagtttg ggaggctgag ccgggtggat 2340 
cgcctgaggt caggagttca agacaagcct ggcctacatg gttgaaaccc catctctact 2400 
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aaaaa tacac aaattagcta ggcatggtgg actcgcctgt aatctcacta cacaggaggc 2460 
■XKS2XS S!t3cSS iacctgggag gcggatgttg aagtgagctg agattgcacc 2520 
a??o?actcc aqtctgggtg agagtgagac tcagtcttaa aaaaatatgc ctttttgaag 2580 
rarnrlcatt ttgtaiclaa gaactgaigc tcttattata ttattagttt tgatttaatg 264Q 
t?r?clSccc atctcctttc itatttctgg gagacagaaa acatgtttcc ctacacctct 2700 
SSScca? Stcaacacc caactgtctc gatgcaatga acacttaata aaaaacagtc 2760 



gattggtc 

<210> 298 
<211> 1358 
<212> DNA . 
<213> Homo sapiens 

<400> 298 



no?Stccaca cqctgcacaa tggcggctct gaagagttgg ctgtcgcgca gcgnaacttc 60 
??^rclaa ticaaacaqt gtttgtgtgt tcctgttgtg gctaacttta agaagcggtg 120 
>«cSaqaa ttgataagac catggcacaa aactgtgacg attggctttg gagtaaccct 180 
SSSSqtt cctattgcac agaiatcaga gcctcattcc cttagtagtg aagcattgat 240 
gtgxgcgg^t ^V"*;^ • r ? araaa1 -| a cacctctacc tttctctctc agaccacata 300 
££33tt SSESSta SS2t!Xc .SSSrtgtt tataccttaa cttctcttta 360 
SuaSaSt-afcSStttS ttgggaaaat gaattcagag gaggaagatg aagtgtggca 420 
nnroa?cata qqagccagag ctgigatgac ttcaaaacac caagagtact tgaagctgga 480 
SKSStS Itgictgcag ttggtctttc agagatggca gcagaagctg catatcaaae 540 
rnalaclalt caggcctcta taaccgccag gaatcacatt cagctggtga aactgcaggt 600 
-2£ Sa caccaactct cccggaaagc igaaaecaag ctggcagaag cacagataga 660 

h isss fass sssss ?sssss iraj 1 

«acctacct qggtcagaga gaggagtcct ttttgtcatg cccttaagtt cagcaactgt 1080 
ttaacrtqtt ttcagtctta tttacgtcgt caaaaatgat ttagtacttg ttccctctgt 1140 
taaaatacca gttgtggcag ggggagggga acctgtccag tttgtacgat ttctttgtat 1200 
itatttctS' ?gtgt?ctctgatctgcccc cactgtcctg tgaggacagc tgaggccaag 1260 
gagSaaaal c?tattacta ctaagagaag gggtgcagag tgtttacctg gtgctctcaa 1320 
caggacttaa catcaacagg acttaacaca gaaaaaaa 1550 



<210> 299 
<211> 4407 
<212> DNA 

<213> Homo sapiens 
<400> 299 



tlrrnxlrca cactccooct gctgtcactt ggctctctgg ctggagcttg aggacgcaag 60 . 
SSSiSSt caSggcaga Etcgagactg taggcactgc catggcccct gtgctcagta 120 
annlrtcaS qqacatcgag agtitcctgg. ctttaaatcc tcgaacacaa actcatgcaa 180 
rtSSSttc cSScggcc algaaattag acaagaaaca ttggaaaaga aatcctgata 240 
aaaactactt taattgtgag aagctggaga ataattttga tgacatcaag cacacgactc >00 
?taotaS aqqaqctctc cgigaigcia tgagatgcct gaaatgtgca gatgccccgt 360 
o^inataal ?tot?caact aitcttgata ttaaatcatt catcacaagt attgcaaaca 420 
laaartat?! SgagSgct aagatgatat tttctgacaa cccacttggt ctgacttgtg 480 
aaatqSatg t?calcctct gatctatgtg taggtggatg caatttatat gccactgaag 540 
?^ni?;raf raatattaat qqattqcaqc aatttgctac tgaggtattc aaagcaatga 600 
a?Itccca?a JXSS2E ?c?tcg"gc ctcccccaga aaaaatgtct gaagcctatt 660 
??gSaaga? tgctcttttt ggtgctgggc ctgcaagtat aagttgtgct. tcctttttgg 720 
ricaartaaa atactctgac atcactatat ttgaaaaaca agaatatgtt ggtggtttaa 780 
SarriSo! lattcctcag ttccggctgc cgtatgatgt agtgaatttt gagattgagc 840 
?SSaaqqa ??«ggtg?i aagataattt gcggtiaalg cctttcagtg aatgaaatga 900 
SStJaca? tttgilagaa aaaggctaca Eagctgcttt cattggaata ggtttgccag 960 
aac"latta agatgccitc ttccaaggcc tgacgcagga ccaggggttt tatacatcca 1020 
SaSactttS g£cacttgta gccaaaggca gtaaagcagg aatgtgcgcc tgtcactctc 1080 
?attncratc aatacgggga gtcgtgattg tacttggagc tggagacact gccttcgact 1140 
a?qcaaca« tgc?ctacgt tgtggagctc gccgagtgtt catcgtcttc agaaaaggct 1200 
?^ar?aa?at aaqaqctgtc cctgaggaga tggagcttgc taaggaagaa aagtgtgaat 1260 
««q??a« c?tgtcccca cgglaggtta tagtaaaagg tgggagaatt gttgctatgc 1320 
aa«tqt"g gacigagcaa gatgaalctg gaaaatggaa tgaagatgaa gatcagatgg 1380 
??Stctgaa Igccgatgtg gtcatcagtg cctttggttc agttctgagt gatcctaaag 1440 
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taaaagaagc cttgagccct ataaaattta acagatgggg tctcccagaa gtagatccag 1500 
aaactatgca aactagtgaa gcatgggtat ttgcaggtgg tgatgtcgtt ggtttggcta 1560 
acactacagt ggaatcggtg aatgatggaa agcaagcttc ttggtacatt cacaaatacg 1620 
tacagtcaca atatggagct tccgtttctg ccaagcctga actacccctc ttttacactc 1680 
ctattgatct ggtggacatt agtgtagaaa tggccggatt gaagtttata aatccttttg 1740 
gtcttgctag cgcaactcca gccaccagca catcaatgat tcgaagagct tttgaagctg 1800. 
qatggggttt tgccctcacc aaaactttct ctcttgataa ggacattgtg acaaatgttt 1860 
cccccagaat catccgggga accacctctg gccccatgta tggccctgga caaagctcct 1920 
ttctgaatat tgagctcatc agtgagaaaa cggctgcata ttggtgtcaa agtgtcactg 1980 
aactaaaggc tgacttccca gacaacattg tgattgctag cattatgtgc agttacaata 2040 
aaaatgactg gacggaactt gccaagaagt ctgaggattc tggagcagat gccctggagt 2100 
taaatttatc atgtccacat ggcatgggag aaagaggaat gggcctggcc tgtgggcagg 2160 
atccagagct ggtgcggaac atctgccgct gggttaggca agctgttcag attccttttt 2220 
ttgccaagct gaccccaaat gtcactgata ttgtgagcat cgcaagagct gcaaaggaag 2280 
gtggtgccaa tggcgttaca gccaccaaca ctgtctcagg tctgatggga ttaaaatctg 2340 
atggcacacc ttggccagca gtggggattg caaagcgaac tacatatgga ggagtgtctg 2400 
ggacagcaat cagacctatt gctttgagag ctgtgacctc cattgctcgt gctctgcctg 2460 
gatttcccat tttggctact ggtggaattg actctgctga aagtggtctt cagtttctcc 2520 
atagtggtgc ttccgtcctc caggtatgca gtgccattca gaatcaggat ttcactgtga 2580 
tcgaagacta ctgcactggc ctcaaagccc tgctttatct gaaaagcatt gaagaactac 2640 
aagactggga tggacagagt ccagctactg tgagtcacca gaaagggaaa ccagttccac 2700 
gtatagctga actcatggac aagaaactgc caagttttgg accttatctg gaacagcgca 2760 
agaaaatcat agcagaaaac aagattagac tgaaagaaca aaatgtagct ttttcaccac 2820 
ttaagagaag ctgttttatc cccaaaaggc ctattcctac catcaaggat gtaataggaa 2880 
aagcactgca gtaccttgga acatttggtg aattgagcaa cgtagagcaa gttgtggcta 2940 
tgattgatga agaaatgtgt atcaacigtg gtaaatgcta catgacctgt aatgattctg 3000 
gctaccaggc tatacagttt gatecagaaa cccacctgcc caccataacc gacacttgta 3060 
caggctgtac tctgtgtctc agtgtttgcc ctattgtcga ctgcatcaaa atggtttcca 3120 
ggacaacacc ttatgaacca aagagaggcg tacccttatc tgtgaatccg gtgtgttaag 3180 
gtgatttgtg aaacagttgc tgtgaacttt catgtcacct acatatgctg atctcttaaa 3240 
atcatgatcc ttgtgttcag ctctttccaa attaaaacaa atatacattt tctaaataaa 3300 
aatatgtaat ttcaaaatac atttgtaagt gtaaaaaatg tctcatgtca atgaccattc 3360 
aattagtggc ataaaataga ataattcttt tctgaggata gtagttaaat aactgtgtgg 3420 
cagttaattg gatgttcact gccagttgtc ttatgtgaaa aattaacttt ttgtgtggca 3480 
attagtgtga cagtttccaa attgccctat gctgtgctcc atatttgatt tctaattgta 3540 
agtgaaatta agcattttga aacaaagtac tctttaacat acaagaaaat gtatccaagg 3600 
aaacatttta tcaataaaaa ttacctttaa ttttaatgct gtttctaaga aaatgtagtt 3660 
agctccataa agtacaaatg aagaaagtca aaaattattt gctatggcag gataagaaag 3720 
cctaaaattg agtttgtgga ctttattaag taaaatcccc ttcgctgaaa ttgcttattt 3780 
ttggtgttgg atagaggata gggagaatat ttactaacta aataccattc actactcatg 3840 
cgtgagatgg gtgtacaaac tcatcctctt ttaatggcat ttctctttaa actatgttcc 3900 
taaccaaatg agatgatagg atagatcctg gttaccactc ttttactgtg cacatatggg 3960 
ccccggaatt ctttaatagt caccttcatg attatagcaa ctaatgtttg aacaaagctc 4020 
aaagtatgca atgcttcatt attcaagaat gaaaaatata atgttgataa tatatattaa 4080 
gtgtgccaaa tcagtttgac tactctctgt tttagtgttt atgtttaaaa gaaatatatt 4140 
ttttgttatt attagataat atttttgtat ttctctattt tcataatcag taaatagtgt 4200 
catataaact catttatctc ctcttcatgg catcttcaat atgaatctat aagtagtaaa 4260 
tcagaaagta acaatctatg gcttatttct atgacaaatt caagagctag aaaaataaaa 4320 
tgtttcatta tgcactttta gaaatgcata tttgccacaa aacctgtatt actgaataat 4380 
atcaaataaa atatcataaa gcatttt 4407 

<210> 300 
<211> 5532 
<212> DNA 

<213> Homo sapiens 
<400> 300 

gccgcgctgc gccggagtcc cgagctagcc ccggcgccgc cgccgcccag accggacgac 60 

aggccacctc gtcggcgtcc gcccgagtcc ccgcctcgcc gccaacgcca caaccaccgc 120 

gcacggcccc ctgactccgt ccagtattga tcgggagagc cggagcgagc tcttcgggga 180 

gcagcgatgc gaccctccgg gacggccggg gcagcgctcc tggcgctgct ggctgcgctc 240 

tgcccggcga gtcgggctct ggaggaaaag aaagtttgcc aaggcacgag taacaagctc 300 

acgcagttgg gcacttttga agatcatttt ctcagcctcc agaggatgtt caataactgt 360 

gaggtggtcc ttgggaattt ggaaattacc tatgtgcaga ggaattatga tctttccttc 420 

ttaaagacca tccaggaggt ggctggttat gtcctcattg ccctcaacac agtggagcga 480 

attcctttgg aaaacctgca gatcatcaga ggaaatatgt actacgaaaa ttcctatgcc 540 

ttagcagtct tatctaacta tgatgcaaat aaaaccggac tgaaggagct gcccatgaga 600 

aatttacagg aaatcctgca tggcgccgtg cggttcagca acaaccctgc cctgtgcaac 660 
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gtggagagca 
gacttccaga 
tgctggggtg 
tgctccgggc 
ggctgcacag 
acgtgcaagg 
gtgaaccccg 
tatgtggtga 
gaggaagacg 
ggaataggta 
ttcaaaaact 
gactccttca 
aaggaaatca 
gcctttgaga 
gcagtcgtca 
ggagatgtga 
aaactgtttg 
tgcaaggcca 
gagcccaggg 
tgcaagcttc 
cacccagagt 
tgtatccagt 
gtcatgggag 
ctgtgccatc 
aatgggccta 
gtggtggccc 
ctgcggaggc 
cccaaccaag 
ggctccggtg 
aaaattcccg 
atcctcgatg 
ggcatctgcc 
ctggactatg 
gtgcagatcg 
gcagccagga 
gccaaactgc 
aagtggatgg 
agctacgggg 
cctgccagcg 
tgtaccatcg 
ccaaagttcc 
cttgtcattc 
cgtgccctga 
ccacagcagg 
agtgcaacca 
cccatcaagg 
gaggacagca 
aaaaggcccg 
cccagcagag 
ctcaacactg 
cagaaaggca 
aaggaagcca 
agggtcgcgc 
ctaaaaatcc 
agccatgccc 
gccaggaagt 
tgtgaagcat 
ctttcaaaga 
ggggatcttg 
aggaagaagc 
aggagcacaa 
ttccactgca 
tactgtatca 
gaagaaacgg 
acttactccc 
tcttccattc 
gcaagagagg 
catttggacc 



tccagtggcg 
accacctggg 
caggagagga 
gctgccgtgg 
gcccccggga 
acacctgccc 
agggcaaata 
cagatcacgg 
gcgtccgcaa 
ttggtgaatt 
gcacctccat 
cacatactcc 
cagggttttt 
acctagaaat 
gcctgaacat 
taatttcagg 
ggacctccgg 
caggccaggt 
actgcgtctc 
tggagggtga 
gcctgcctca 
gtgcccacta 
aaaacaacac 
caaactgcac 
agatcccgtc 
tggggatcgg 
tgctgcagga 
ctctcttgag 
cgttcggcac 
tcgctatcaa 
aagcctacgt 
tcacctccac 
tccgggaaca 
caaagggcat 
acgtactggt 
tgggtgcgga 
cattggaatc 
tgaccgtttg 
agatctcctc 
atgtctacat 
gtgagttgat 
agggggatga 
tggatgaaga 
gcttcttcag 
gcaacaattc 
aagacagctt 
tagacgacac 
ctggctctgt 
acccacacta 
tccagcccac 
gccaccaaat 
agccaaatgg 
cacaaagcag 
agactctttc 
gcattagctc 
acttccacct 
ttacagaaac 
ggtatatttg 
gagtttttca 
ttgctggtag 
gccacaagtc 
aaacactaaa 
agtcatggca 
aggggatgaa 
cactgatgga 
cattgttttg 
atgacacatc 
aatagcccac 
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ggacatagtc agcagtgact ttctcagcaa 
cadctgccaa aagtgtgatc caagctgtcc 
gaactgccag aaactgacca aaatcatctg 
caagtccccc agtgactgct gccacaacca 
gaqcgactgc ctggtctgcc gcaaattccg 
cccactcatg ctctacaacc ccaccacgta 
cagctttggt gccacctgeg tgaagaagtg 
ctcgtgcgtc cgagcctgtg gggccgacag 
qtgtaagaag tgcgaagggc cttgccgcaa 
taaagactca ctctccataa atgctacgaa 
cagtggcgat ctccacatcc tgccggtggc 
tcctctggat ccacaggaac tggatattct 
gctgattcag gcttggcctg aaaacaggac 
catacgcggc aggaccaagc aacatggtca 
aacatccttg ggattacgct ccctcaagga 
aaacaaaaat ttgtgctatg caaatacaat 
tcagaaaacc aaaattataa gcaacagagg 
ctgccatgcc ttgtgctccc ccgagggctg 
ttgccggaat gtcagccgag gcagggaatg 
qccaagggag tttgtggaga actctgagtg 
ggccatgaac atcacctgca caggacgggg 
cattgacggc ccccactgcg tcaagacctg 
cctggtctgg aagtacgcag acgccggcca 
ctacggatgc actgggccag gtcttgaagg 
catcgccact gggatggtgg gggccctcct 
cctcttcatg cgaaggcgcc acatcgttcg 
qagggagctt gtggagcctc ttacacccag 
gatcttgaag gaaactgaat tcaaaaagat 
qgtgtataag ggactctgga tcccagaagg 
ggaattaaga gaagcaacat ctccgaaagc 
gatggccagc gtggacaacc cccacgtgtg 
cgtgcaactc atcacgcagc tcatgccctt 
caaagacaat attggctccc agtacctgct 
gaactacttg gaggaccgtc gcttggtgca 
gaaaacaccg cagcatgtca agatcacaga 
agagaaagaa taccatgcag aaggaggcaa 
aattttacac agaatctata cccaccagag 
ggagttgatg acctttggat ccaagccata 
catcctggag aaaggagaac gcctccctca 
gatcatggtc aagtgctgga tgatagacgc 
catcgaattc tccaaaatgg cccgagaccc 
aagaatgcat ttgccaagtc ctacagactc 
agacatggac gacgtggtgg atgccgacga 
cagcccctcc acgtcacgga ctcccctcct 
caccgtggct tgcattgata gaaatgggct 
cttgcagcga tacagctcag accccacagg 
cttcctccca gtgcctgaat acataaacca 
qcagaatcct gtctatcaca atcagcctct 
ccaggacccc cacagcactg cagtgggcaa 
ctgtgtcaac agcacattcg acagccctgc 
tagcctggac aaccctgact accagcagga 
catctttaag ggctccacag ctgaaaatgc 
tgaatttatt ggagcatgac cacggaggat 
gatacccagg accaagccac agcaggtcct 
ttagacccac agactggttt tgcaacgttt 
cgggcacatt ttgggaagtt gcattccttt 
gcatccagca agaatattgt ccctttgagc 
aaaaaaaaaa aaaaagtata tgtgaggatt 
ttqtcgctat tgatttttac ttcaatgggc 
cacttgctac cctgagttca tccaggccca 
ttccagagga tgcttgattc cagtggttct 
gatccaagaa ggccttcatg gccccagcag 
qgtacagtag gataagccac tctgtccctt 
ttcttcctta gacttacttt tgtaaaaatg 
ccagtggttt ccagtcatga gcgttagact 
aaactcagta tgccgcccct gtcttgctgt 
aaataataac tcggattcca gcccacattg 
agctgagaat gtggaatacc taaggataac 
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catgtcgatg 
caatgggagc 
tgcccagcag 
gtgtgctgca 
agacgaagcc 
ccagatggat 
tccccgtaat 
ctatgagatg 
agtgtgtaac 
tattaaacac 
atttaggggt 
gaaaaccgta 
ggacctccat 
gttttctctt 
gataagtgat 
aaactggaaa 
tgaaaacagc 
ctggggcccg 
cgtggacaag 
catacagtgc 
accagacaac 
cccggcagga 
tgtgtgccac 
ctgtccaacg 
cttgctgctg 
gaagcgcacg 
tggagaagct 
caaagtgctg 
tgagaaagtt 
caacaaggaa 
ccgcctgctg 
cggctgcctc 
caactggtgt 
ccgcgacctg 
ttttgggctg 
agtgcctatc 
tgatgtctgg 
tgacggaatc 
gccacccata 
agatagtcgc 
ccagcgctac 
caacttctac 
gtacctcatc 
gagctctctg 
gcaaagctgt 
cgccttgact 
gtccgttccc 
gaaccccgcg 
ccccgagtat 
ccactgggcc 
cttctttccc 
agaataccta 
agtatgagcc 
ccatcccaac 
acaccgacta 
gtcttcaaac 
agaaatttat 
tttattgatt 
tcttccaaca 
actgtgagca 
gcttcaaggc 
gccggatcgg 
cctgggcaaa 
tccccacggt 
gacttgtttg 
catgaaatca 
gattcatcag 
accgcttttg 



720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660. 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 



SUBSTITUTE SHEET (RULE 26) 



. <wo_ 



_0307B662A1J_> 



WO 03/078662 



PCT/US0J/0771J 



39740-000lPCT.txt 

ttctcgcaaa aacgtatctc ctaatttgag gctcagatga aatgcatcag gtcctttggg 4800 
qcatagatca gaagactaca aaaatgaagc tgctctgaaa tctcctttag ccatcacccc 4860 
aaccccccaa aattagtttg tgttacttat ggaagatagt tttctccttt tacttcactt 4920 
caaaagcttt ttactcaaag agtatatgtt ccctccaggt cagctgcccc caaaccccct 4980 
ccttacgctt tgtcacacaa aaagtgtctc tgccttgagt catctattca agcacttaca 5040 
gctctggcca caacagggca ttttacaggt gcgaatgaca gtagcattat gagtagtgtg 5100 
aattcaggta gtaaatatga aactagggtt tgaaattgat aatgctttca caacatttgc 5160 
agatgtttta gaaggaaaaa agttccttcc taaaataatt tctctacaat tggaagattg 5220 
gaagattcag ctagttagga gcccattttt tcctaatctg tgtgtgccct gtaacctgac 5280 
tggttaacag cagtcctttg taaacagtgt tttaaactct cctagtcaat atccacccca 5340 
tccaatttat caaggaagaa atggttcaga aaatattttc agcctacagt tatgttcagt 5400 
cacacacaca tacaaaatgt tccttttgct tttaaagtaa tttttgactc ccagatcagt 5460 
cagagcccct acagcattgt taagaaagta tttgattttt gtctcaatga aaataaaact 5520 
atattcattt cc 5532 



<210> 301 
<211> 1528 
<212> DNA 

<213> Homo sapiens 
<400> 301 

cggcgagcga gcaccttcga cgcggtccgg ggaccccctc gtcgctgtcc tcccgacgcg 60 
gacccgcgtg ccccaggcct cgcgctgccc ggccggctcc tcgtgtccca ctcccggcgc 120 
acgccctccc gcgagtcccg ggcccctccc gcgcccctct tctcggcgcg cgcgcagcat 180 
ggcgcccccg caggtcctcg cgttcgggct tctgcttgcc gcggcgacgg cgacttttgc 240 
cgcagctcag gaagaatgtg tctgtgaaaa ctacaagctg gccgtaaact gctttgtgaa 300 
taataatcgt caatgccagt gtacttcagt tggtgcacaa aatactgtca tttgctcaaa 360 
gctggctgcc aaatgtttgg tgatgaaggc agaaatgaat ggctcaaaac ttgggagaag 420 
agcaaaacct gaaggggccc tccagaacaa tgatgggctt tatgatcctg actgcgatga 480 
gagcgggctc tttaaggcca agcagtgcaa cggcacctcc acgtgctggt gtgtgaacac 540 
tgctggggtc agaagaacag acaaggacac tgaaataacc tgctctgagc gagtgagaac 600 
ctactggatc atcattgaac taaaacacaa agcaagagaa aaaccttatg atagtaaaag 660 
tttgcggact gcacttcaga aggagatcac aacgcgttat caactggatc caaaatttat 720 
cacgagtatt ttgtatgaga ataatgttat cactattgat ctggttcaaa attcttctca 780 
aaaaactcag aatgatgtgg acatagctga tgtggcttat tattttgaaa aagatgttaa 840 
aggtgaatcc ttgtttcatt ctaagaaaat ggaectgaca gtaaatgggg aacaactgga 900 
tctggatcct ggtcaaactt taatttatta tgttgatgaa aaagcacctg aattctcaat 960 
gcagggtcta aaagctggtg ttattgctgt tattgtggtt gtggtgatag cagttgttgc 1020 
tggaattgtt gtgctggtta tttccagaaa gaagagaatg gcaaagtatg agaaggctga 1080 
gataaaggag atgggtgaga tgcataggga actcaatgca taactatata atttgaagat 1140 
tatagaagaa gggaaatagc aaatggacac aaattacaaa tgtgtgtgcg tgggacgaag 1200 
acatctttga aggtcatgag tttgttagtt taacatcata tatttgtaat agtgaaacct 1260 
gtactcaaaa tataagcagc ttgaaactgg ctttaccaat cttgaaattt gaccacaagt 1320 
gtcttatata tgcagatcta atgtaaaatc cagaacttgg actccatcgt taaaattatt 1380 
tatgtgtaac attcaaatgt gtgcattaaa tatgcttcca cagtaaaatc tgaaaaactg 1440 
atttgtgatt gaaagctgcc tttctattta cttgagtctt gtacatacat acttttttat 1500 
gagctatgaa ataaaacatt ttaaactg 1528 

<210> 302 

<211> 1856 

<212> DNA 

<213> Homo sapiens 

<400> 302 

ctgacttggc aggactgtgc aattgtcaga aggccgtggg gagtgggggc cagtgcctgc 60 
agcctgccct gcctctctca caggccctta gagcatcgcc aggtgcagag ctccacagct 120 
ctctttccca aggagtaatc agagggtgag aacgtggagc ctggtggaca ggtgaaagca 180 
ctgggatctt tctgcccaga aaggggaaag ttgcacattt atatcctaga gggaagcgac 240 
agcagtgctt ctccctgtgc tgaggtacag gagccatgtg gctagaaatc ctcctcactt 300 
cagtgctggg ctttgccatc tactggttca tctcccggga caaagaggaa actttgccac 360 
ttgaagatgg gtggtggggg ccaggcacga ggtccgcagc cagggaggac gacagcatcc 420 
gccctttcaa ggtggaaacg tcagatgagg agatccacga cttacaccag aggatcgata 480 
agttccgttt caccccacct ttggaggaca gctgcttcca ctatggcttc aactccaact 540 
acctgaagaa agtcatctcc tactggcgga atgaatttga ctggaagaag caggtggaga 600 
ttctcaacag ataccctcac ttcaagacta agattgaagg gctggacatc cacttcatcc 660 
acgtgaagcc cccccagctg cccgcaggcc ataccccgaa gcccttgctg atggtgcacg 720 
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gctggcccgg 

accatggcct 

gcttctcaga 

acaagctgat 

ccctgatctg 

acatggcttt 

ggaggtttct 

tattctacag 

ccgtaggctc 

tttccacctg 

tctccctgga 

cctcccagcg 

ggatgaaggt 

ctgaaaagtg 

ggggccactt 

tgtcggtgct 

agtgccctcc 

gtcccctgcc 

acccccaact 



ctctttctac 

gagcgatgag 

ggcatcctcc 

gctgcggctg 

cactaatatg 

ggttttaagc 

tggcctcact 

cctgatgagg 

tgctctgaat 

gaccaatacg 

cgacctgctg 

cttctacaag 

ctatgtgccc 

ggtgaggttc 

tgcggccttt 

ggagcggcaa 

aggcttttct 

catgctggga 

ccgtgtggta 



39740-0001PCT.txt 
gagttttata agatcatccc actcctgact 
cacgtttttg aagtcatctg cccttccatc 
aagaaggggt tcaactcggt ggccaccgcc 
ggcttccagg aattctacat tcaaggaggg 
gcccagctgg tgcccagcca cgtgaaaggc 
aacttctcta ccctgaccct cctcctggga 
gaqagggatg tggagctgct gtaccccgtc 
gagagcggct acatgcacat ccagtgcacc 
gactctcctg tgggtctggc tgcctatatt 
gaattcegat acctggagga tggaggcctg 
Iccaacgtca tgctctactg gacaacaggc 
gagaacctgg gacagggctg gatgacccag 
Ectggcttct ctgccttccc ttttgagcta 
aagtacccaa agctcatctc ctattcctac 
qaggagccgg agctgctcgc ccaggacatc 
tqacccaccc ctctcccccc gcctgccacc 
tggggaagat accccttttc tgaggaatga 
gcccacgctc accccctcac ccctccaagc 
agcaacatgg ctttgatgat aaacgacttt 



gaccccaaga 

cctggctatg 

aggatctttt 

gactgggggt 

ctgcacttga 

cagcgtttcg 

aaggagaagg 

aagcctgaca 

ctagagaagt 

gaaaggaagt 

accatcatct 

aagcatgagc 

ttgcacacgc 

atggttcgtg 

cgcaagttcc 

tccccccaca 

gtttgcctcc 

tcactcccca 

act eta 



780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1856 



<210> 303 
<211> 6450 
<212> DNA 

<213> Homo sapiens 



<400> 303 
gagttgtgcc 
cctccagcac 
gggagcccag 
gccggcttca 
egggacatge 
ggtttctgag 
atgaccatga 
aacgagctgg 
gaggtgtacc 
gagttcaacg 
ggccccgggt 
aacagcgtgt 
ctgcagcccc 
gtgegegagg 
ggcagagaaa 
gagactcget 
tcctgtgagg 
tgtccagcca 
cggctccgca 
ggagggagaa 
gggtctgctg 
tctaagaaga 
gatgetgage 
tcgatgatgg 
gcgaagaggg 
tgtgcctggc 
aagctactgt 
atggtggaga 
cagggagagg 
tttctgtcca 
aagatcacag 
caccagcggc 
ggcatggagc 
ctggagatgc 
gaggagaegg 
aagtattaca 
tcccacacgg 
aattctgtct 
ggecattcat 
ccaaagggat 
aagcgtgagg 
taactctgtg 



tggagtgatg 
ctttgtaatg 
gagctggegg 
ccggaccgca 
gctgcgtcgc 
ccttctgccc 
ccctccacac 
agcccctgaa 
tggacagcag 
ccgcggccgc 
ctgaggctgc 
ctccgagccc 
aeggecagea 
ccggcccgcc 
gattggccag 
actgtgcagt 
getgeaagge 
ccaaccagtg 
aatgetaega 
tgttgaaaca 
gagacatgag 
acagcctggc 
cccccatact 
gcttactgac 
tgecaggett 
tagagatcct 
ttgctcctaa 
tcttcgacat 
agtttgtgtg 
gcaccctgaa 
acactttgat 
tggcccagct 
atctgtacag 
tggacgccca 
accaaagcca 
teaeggggga 
ttcagataat 
cctgcataca 
ttgettgetc 
tccaaggcta 
attccegtag 
catttaagct 



tttaagccaa 
catatgagct 
agggegtteg 
ggctcccggg 
ctctaacctc 
tgeggggaca 
caaagcatct 
ccgtccgcag 
caagcccgcc 
cgccaacgcg 
ggcgttcggc 
getgatgeta 
ggtgccctac 
ggcattctac 
taccaatgac 
gtgeaatgae 
cttcttcaag 
caccattgat 
agtgggaatg 
caagcgccag 
agctgccaac 
cttgtccctg 
ctattccgag 
caacctggca 
tgtggatttg 
gatgattggt 
ettgetcttg 
getgetgget 
cctcaaatct 
gtctctggaa 
ccacctgatg 
cctcctcatc 
catgaagtgc 
ccgcctacat 
cttggccact 
ggcagagggt 
ccctgctgca 
ctccggcatg 
agttcttagt 
aatctttgta 
ctcttcacag 
acttgtagag 



tgtcagggca aggcaacagt 
egggagacca gtacttaaag 
tcctgggagc tgeacttget 
geagggcegg ggecagaget 
gggctgtgct ctttttccag 
cggtctgcac cctgcccgcg 
gggatggccc tactgeatea 
ctcaagatcc ecctggagcg 
gtgtacaact accccgaggg 
caggtctacg gtcagacegg 
tccaacggcc tggggggttt 
ctgcacccgc cgccgcagct 
tacctggaga acgagcccag 
aggecaaatt cagataatcg 
aagggaagta tggctatgga 
tatgettcag gctaccatta 
agaagtattc aaggacataa 
aaaaacagga ggaagagctg 
atgaaaggtg ggatacgaaa 
agagatgatg gggagggcag 
ctttggccaa gcccgctcat 
acggccgacc agatggtcag 
tatgatccta ccagaccctt 
gacagggagc tggttcacat 
accctccatg atcaggtcca 
ctcgtctggc gctccatgga 
gacaggaacc agggaaaatg 
acatcatctc ggttccgcat 
attattttgc ttaattctgg 
gagaaggacc atatccaccg 
gecaaggcag gcctgaccct 
ctctcccaca tcaggcacat 
aagaacgtgg tgcccctcta 
gcgcccacta gccgtggagg 
gegggctcta cttcatcgca 
ttccctgcca cagtctgaga 
ttttaccctc atcatgcacc 
catccaacac caatggcttt 
ggcacatctt ctgtcttctg 
acagctctct ttcccccttg 
ctgaactcag tctatgggtt 
acccaggcct ggagagtaga 
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ccctggccgt 
ttggaggccc 
ccgtcgggtc 
cgcgtgtcgg 
gtggcccgcc 
gccacggacc 
gatccaaggg 
gcccctgggc 
cgccgcctac 
cctcccctac 
ccccccactc 
gtcgcctttc 
cggctacacg 
aegecagggt 
atetgecaag 
tggagtctgg 
cgactatatg 
ccaggcctgc 
agaccgaaga 
gggtgaagtg 
gatcaaaege 
tgccttgttg 
cagtgaagct 
gatcaactgg 
ccttctagaa 
gcacccagtg 
tgtagagggc 
gatgaatctg 
agtgtacaca 
agtcctggac 
gcagcagcag 
gagtaacaaa 
tgacctgctg 
ggcatccgtg 
ttccttgcaa 
gctccctggc 
actttageca 
ctagatgagt 
ttgggaacag 
ctatgttact 
ggggctcaga 
cattttgect 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
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ctgataagca ctttttaaat ggctctaaga ataagccaca gcaaagaatt taaagtggct 2580 
cctttaattg gtgacttgga gaaagctagg tcaagggttt attatagcac cctcttgtat 2640 
tcetatggca atgcatcctt ttatgaaagt ggtacacctt aaagctttta tatgactgta 2700 
gcagagtatc tggtgattgt caattcactt ccccctatag gaatacaagg ggccacacag 2760 
ggaaggcaga tcccctagtt ggccaagact tattttaact tgatacactg cagattcaga 2820 
gtgtcctgaa gctctgcctc tggctttccg gtcatgggtt ccagttaatt catgcctccc 2880 
atggacctat ggagagcaac aagttgatct tagttaagtc tccctatatg agggataagt 2940 
tcctgatttt tgtttttatt tttgtgttac aaaagaaagc cctccctccc tgaacttgca 3000 
gtaaggtcag cttcaggacc tgttccagtg ggcactgtac ttggatcttc ccggcgtgtg 3060 
tgtgccttac acaggggtga actgttcact gtggtgatgc atgatgaggg taaatggtag 3120 
ttgaaaggag caggggccct ggtgttgcat ttagccctgg ggcatggagc tgaacagtac 3180 
ttgtgcagga ttgttgtggc tactagagaa caagagggaa agtagggcag aaactggata 3240 
cagttctgag cacagccaga cttgctcagg tggccctgca caggctgcag ctacctagga 3300 
acattccttg cagaccccgc attgcctttg ggggtgccct gggatccctg gggtagtcca 3360 
gctcttattc atttcccagc gtggccctgg ttggaagaag cagctgtcaa gttgtagaca 3420 
gctgtgttcc tacaattggc ccagcaccct ggggcacggg agaagggtgg ggaccgttgc 3480 
tgtcactact caggctgact ggggectggt cagattacgt atgcccttgg tggtttagag 3540 
ataatccaaa atcagggttt ggtttgggga agaaaatcct cccccttcct cccccgcccc 3600 
gttccctacc gcctccactc ctgccagctc atttccttca atttcctttg acctataggc 3660 
taaaaaagaa aggctcattc cagccacagg gcagccttcc ctgggccttt gcttctctag 3720 
cacaattatg ggttacttcc tttttcttaa caaaaaagaa tgtttgattt cctctgggtg 3780 
accttattgt ctgtaattga aaccctattg agaggtgatg tctgtgttag ccaatgaccc 3840 
aggtagctgc tcgggcttct cttggtatgt ettgtttgga aaagtggatt tcattcattt 3900 
ctgattgtcc agttaagtga tcaccaaagg actgagaatc tgggagggca aaaaaaaaaa 3960 
aaaaagtttt tatgtgcact taaatttggg gacaatttta tgtatctgtg ttaaggatat 4020 
gcttaagaac ataattcttt tgttgctgtt tgtttaagaa geaccttagt ttgtttaaga 4080 
agcaccttat atagtataat atatattttt ttgaaattac attgcttgtt tatcagacaa 4140 
ttgaatgtag taattctgtt ctggatttaa tttgactggg ttaacatgca aaaaccaagg 4200 
aaaaatattt agtttttttt tttttttttg tatacttttc aagctacctt gtcatgtata 4260 
cagtcattta tgcctaaagc ctggtgatta ttcatttaaa tgaagatcac atttcatatc 4320 
aacttttgta tccacagtag acaaaatagc actaatccag atgcctattg ttggatattg 4380 
aatgacagac aatcttatgt agcaaagatt atgcctgaaa aggaaaatta ttcagggcag 4440 
ctaattttgc ttttaccaaa atatcagtag taatattttt ggacagtagc taatgggtca 4500 
gtgggttctt tttaatgttt atacttagat tttcttttaa aaaaattaaa ataaaacaaa 4560 
aaaaatttct aggactagac gatgtaatac cagctaaagc caaacaatta tacagtggaa 4620 
ggttttacat tattcatcca atgtgtttct attcatgtta agatactact acatttgaag 4680 
tgggcagaga acatcagatg attgaaatgt tcgcccaggg gtctccagca actttggaaa 4740 
tctctttgta tttttacttg aagtgccact aatggacagc agatattttc tggctgatgt 4800 
tggtattggg tgtaggaaca tgatttaaaa aaaaaactct tgcctctgct ttcccccact 4860 
ctgaggcaag ttaaaatgta aaagatgtga tttatctggg gggctcaggt atggtgggga 4920 
agtggattca ggaatctggg gaatggcaaa tatattaaga agagtattga aagtatttgg 4980 
aggaaaatgg ttaattctgg gtgtgcacca aggttcagta gagtccactt ctgccctgga 5040 
gaccacaaat caactagctc catttacagc catttctaaa atggcagctt cagttctaga 5100 
gaagaaagaa caacatcagc agtaaagtcc atggaatagc tagtggtctg tgtttctttt 5160 
cgccattgcc tagcttgccg taatgattct ataatgccat catgcagcaa ttatgagagg 5220 
ctaggtcatc caaagagaag accctatcaa tgtaggttgc aaaatctaac ccctaaggaa 5280 
gtgcagtctt tgatttgatt tccctagtaa ccttgcagat atgtttaacc aagccatagc 5340 
ccatgccttt tgagggctga acaaataagg gacttactga taatttactt ttgatcacat 5400 
taaggtgttc tcaccttgaa atcttataca ctgaaatggc cattgattta ggccactggc 5460 
ttagagtact ccttcccctg catgacactg attacaaata ctttcctatt catactttcc 5520 
aattatgaga tggactgtgg gtactgggag tgatcactaa caccatagta atgtctaata 5580 
ttcacaggca gatctgcttg gggaagctag ttatgtgaaa ggcaaataaa gtcatacagt 5640 
agctcaaaag gcaaccataa ttctctttgg tgcaagtctt gggagcgtga tctagattac 5700 
actgcaccat tcccaagtta atcccctgaa aacttactct caactggagc aaatgaactt 5760 
tggtcccaaa tatccatctt ttcagtagcg ttaattatgc tctgtttcca actgcatttc 5820 
ctttccaatt gaattaaagt gtggcctcgt ttttagtcat ttaaaattgt tttctaagta 5880 
attgctgcct ctattatggc acttcaattt tgcactgtct tttgagattc aagaaaaatt 5940 
tctattcatt tttttgcatc caattgtgcc tgaactttta aaatatgtaa atgctgccat 6000 
gttccaaacc catcgtcagt gtgtgtgttt agagctgtgc accctagaaa caacatactt 6060 
gtcccatgag caggtgcctg agacacagac ccctttgcat tcacagagag gtcattggtt 6120 
atagagactt gaattaataa gtgacattat gccagtttct gttctctcac aggtgataaa 6180 
caatgctttt tgtgcactac atactcttca gtgtagagct cttgttttat gggaaaaggc 6240 
tcaaatgcca aattgtgttt gatggattaa tatgcccttt tgccgatgca tactattact 6300 
gatgtgactc ggttttgtcg cagctttgct ttgtttaatg aaacacactt gtaaacctct 6360 
tttgcacttt gaaaaagaat ccagcgggat gctcgagcac ctgtaaacaa ttttctcaac 6420 
ctatttgatg ttcaaataaa gaattaaact 6450 
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<210> 304 

<211> 3336 

<212> DNA 

<213> Homo sapiens 

<220> . 

<221> unsure ^ 
<222> (0>.-.(0) 
<223> n = A, T, C or G 



<400> 304 
cggcggcgac 
agattcataa 
cttcctgttc 
caacaggagt 
tttccagctg 
atccccaaca 
agtggcagta 
cctccaggac 
ctggagacct 
gccctttgcg 
aacaatagcc 
cgcagcatca 
aaggttgagg 
ccctactctt 
actttgaaag 
aagccaggct 
gagacgtctg 
ttgacattgg 
ttggaatcac 
gaactccccc 
ctggtaccta 
ccattgcccc 
cgcattgccc 
ccagggaaag 
actatcaagg 
aaagtggaga 
tcatctcact 
agtgggctta 
aggagggaga 
ccggagctgc 
ccagcagact 
tttaagacac 
cccagaaccc 
agcccagtac 
gatctcagca 
agttcagaac 
gacgtcccca 
ctgacagaag 
atcagctttc 
tttattcctg 
ccgggcactc 
gttctgctcc 
cactgctggg 
ctgaagggtg 
tccattctct 
tagtataaat 
tatccagaga 
ttgcttttag 
acctggatct 
ctttttgccc 
tcctggttaa 
tgcaactgaa 
tccccaatca 
cgaaagggcc 
ccctaggccc 
aaagcgaagg 



tgcagtctgg 
tgaaagctag 
aaaatgcccc 
ctaatcaagc 
ggatcaagat 
atgctaatat 
gtgggcccaa 
tccggcctca 
tgggaccaaa 
agcagaaacg 
tatccaacat 
agcaagagat 
agccttcgag 
acatggccat 
acatctatac 
ggaagaactc 
ccaatggcaa 
accaggtgtt 
agcagaaacg 
tgggcgcacg 
tccagttccc 
tggcggcttc 
ccaaggtgct 
aggagaaact 
aggaagaaat 
gccctccctt 
cctgggagga 
ggtccccaac 
ggagccggtc 
tcttctcaga 
cctctgaccc 
ccattaagga 
ctgaatcctg 
aaacctccca 
ccactccctt 
ccttagacct 
agccaggctc 
gcctggtcct 
ctggcctgga 
agctacagta 
caaggctcag 
tcatagctcc 
accttgtgtt 
ggaacaacaa 
gcccagcagt 
tctccaaatt 
ctgccagaag 
ttttgataga 
tgggttcttc 
ctccctgcca 
aaaagtcttt 
gcagagtgtg 
taccagggag 
cctgacctgc 
cagctgaccg 
tggaccnaaa 



agggtccaca 
cccccgtcgg 
aagtgaaaca 
agaggcctcc 
tattaaccac 
tcacagcatc 
caaattcatc 
aacccaaacc 
acctgcagct 
ggagacctgt 
ccagtggctt 
ggaggaaaag 
accatcagcg 
gatacaattc 
gtggattgag 
catccgccac 
ggtctccttc 
taagccactg 
accgaatcca 
gcggaagatg 
ggtgaaccag 
cctcatgagc 
gctagctgag 
cctgtttgga 
ccagcctggg 
ggaagagtgg 
ttcgtcccaa 
ccggtgtgtc 
tcggaggaaa 
ggggcccagt 
tgcctcccag 
aacgctgccc 
gaggctcacg 
gggtgcctct 
gcaaagtgct 
catctccgtc 
cccggagcca 
ggacacaatg 
cgaggaccca 
gagccctgcc 
tgcaccccaa 
ctgctgcctg 
ccccaagagt 
aggcaatggt 
ctcttacctt 
atcctctaat 
gtgggtagga 
agggaagacc 
actgcaggga 
cctccccgtg 
tgtattgggt 
ggtgcccaga 
actggcattg 
ctggcttcct 
catgggtgtg 
aaaaaaaaaa 



cttgtgattc tcaatggaga 
ccactgattc tcaaaagacg 
tcagaggagg aacctaagag 
aaggaagtgg cggagtccaa 
cccaccatgc ccaacacgca 
atcacagcac tgactgccaa 
ctcatcagct gtgggggagc 
agctatgatg ccaaaaggac 
agggatgtga atcttcctag 
gcagatggtg aggcagcagg 
cgaaagatga gttctgatgg 
gagaattgtc acctggagca 
tcctggcaga actctgtgtc 
gccatcaaca gcactgagag 
gaccactttc cctactttaa 
aacctttccc tgcacgacat 
tggaccattc accccagtgc 
gacccagggt ctccacaatt 
gagctccgcc ggaacatgac 
aagccactgc taccacgggt 
tcactggtgt tgcagccctc 
tcagagcttg cccgccatag 
gaggggatag ctcctctttc 
gaagggtttt ctcctttgct 
gaggaaatgc cacacttagc 
ccctccccgg ccccatcttt 
tctcccaccc caagacccaa 
tcggaaatgc ttgtgattca 
cagcatctac tgcctccctg 
acttcccgct gggccgcaga 
ctcagctact cccaggaagt 
atctcctcca ccccgagcaa 
cccccagcca aagtaggggg 
gaccccttgc ctgaccccct 
cccccccttg aatcaccgca 
ccctttggca actcttxtcc 
caggtttctg gccttgcagc 
aatgacagcc tcagcaagat 
ctgggccctg acaacatcaa 
cttgcccctg tgctcaagct 
gcctctgagt gaggacagca 
attatgcaaa agtagcagtc 
atctgattcc tctgctgtcc 
gaaaagagat taggaacccc 
ccctgatctt tgcagggtgg 
tataaatgta agcttatttc 
tgacctgggg tttcaattga 
tgcagtgcac ggtttcttcc 
cccagacaag. tggatctgct 
tttccaagtc agctttcctg 
caggagttga atttggggtg 
tgtgcgctat tagatgtttc 
acgagaactc aggtggaggc 
tagcttgccc ctcagctttg 
agccagcttg agaacactaa 
aaaaaa 
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gtgaaaacgc 
gaggctgccc 
atcccctgcc 
ctcttgcaag 
agtagtggcc 
gggaaaagag 
cccaactcag 
agaagtgacc 
accacctgga 
ctgcactatc 
actgggctcc 
gcgacaggtt 
tgagcggcca 
gaagcgcatg 
gcacattgcc 
gtttgtccgg 
caaccgctac 
gcccgagcac 
catcaaaacc 
cagctcatac 
ggtgaaggtg 
caagcgagtc 
ttctgcagga 
tccagttcag 
gagacccatc 
caaagaggaa 
gaagtcctac 
acacagggag 
tgtggatgag 
gctcccgttc 
gggaggacct 
atctgtcctc 
actggatttc 
ggggctgatg 
aaggctcctc 
ctcagatata 
caatcgttct 
cctgctggac 
ctggtcccag 
gtccaccatc 
ggcagggact 
acaccctagc 
ctgccaggag 
ccagcctgtt 
tccgtgtaaa 
cttagatcat 
cttctgttcc 
aggctgaggt 
tgccagagtc 
caagaagaaa 
ggaggatgga 
txtgataatg 
ttgagaaggc 
caaagagcca 
ctactcaata 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3336 
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<210> 305 
<211> 2365 
<212> DNA 

<213> Homo sapiens 



<400> 305 

tcccagcctt 

ggcaggagcc 

ccggcgagag 

ggtgagccac 

cctcagccac 

ttttaacatc 

cacggtgcag 

tcatggatcc 

cccctggaat 

ctccgtctac 

cttcaccttc 

aggctcggcc 

gcccgacagc 

agcctcctcg 

ctccggactc 

gtccaggccc 

gaccccactg 

tcacaaaatg 

caggagagca 

gaatgccaat 

taacagaccc 

caaatccaaa 

gtttaacccg 

ccactccagc 

tggaccacac 

acagggcccc 

atcatgaagc 

gccactttgc 

tctgtgaata 

aaaaaaaaat 

tattgcatct 

aaatagtgtg 

ttgtttcctt 

aagagaaaag 

cctctgcaaa 

agggtttcag 

tgtttgtatg 

ttatacagac 

tcgtttgttt 

cattcagttg 



cccatccccc 

ccccgacctc 

ggcgcgacga 

caccaccccg 

tcctacatgg 

gacggtcaag 

aggtaccctc 

ctaccctggc 

ctcagcccct 

cccccggcct: 

ccgcccaccc 

ggctcggccc 

atgaagctgg 

tcgacccacc 

ttccccccca 

aaggcccggt 

tggcggcgag 

aacggacaga 

gggacgtcct 

ggggaccctg 

ctgactatga 

aagtgcaaaa 

gccgccctct 

cacatgctga 

cacccctcca 

cagcgagagt 

ctaaacgcga 

aaaggagctc 

agccattctg 

cctgaacatt 

gggtagctgt 

gaaattaaga 

tcactggcca 

aaaaaaaaag 

ggaaatacca 

agagcctttt 

tataattcaa 

cgaactgttg 

gtttcaatat 

gcaaaaaaaa 



caccgaaagc 

ccaggcggac 

cagccgaggc 

ccgtgctcaa 

acgcggcgca 

gcaaccacgt 

cgacccacca 

tggacggcgg 

tctccaagac 

cgtcctcctc 

cgccgaagga 

ggcaggacga 

agtcgtccca 

accccatcac 

gcagcctgct 

ccagcacagg 

atggcacggg 

accggcccct 

gtgcgaactg 

tctgcaatgc 

agaaggaagg 

aagtgcatga 

ccagacacat 

ccacgcccac 

gcatggtcac 

ccctgcagtc 

tggatatatg 

actgtggtgt 

actcatatcc 

gcatataact 

aaggcatgaa 

agaaactagg 

cagttgtttg 

aaaaaagttg 

gttctgggca 

tctaggccta 

agcaccaaaa 

tataaattta 

tttccttctc 

aaaaa 



aaatcattca 

cgcccttccc 

catggaggtg 

cgggcagcac 

gtacccgctg 

cccgccctac 

cgggagccag 

caaagccctg 

gtccatccac 

cttgtcgggg 

cgtctccccg 

gaaagagtgc 

ctcccgtggc 

cacctacccg 

gggcggctcc 

cagggagtgt 

acactacctg 

cattaagccc 

tcagaccacc 

ctgtgggctc 

catccagacc 

ctcactggag 

gtcctccctg 

gccgatgcac 

cgccatgggt 

cctttcgact 

tttttgaagg 

ctgtgttcca 

cctatttaac 

tatattgtaa 

ggatgccaag 

tctgatattc 

atgcattaaa 

taggcgaatc 

atcagtgtta 

catgctttgt 

taagaaaaga 

tttactgcta 

tctcaatttt 



acgacccccg 

tccccgcgcg 

acggcggacc 

ccggacacgc 

ccggaggagg 

tacggaaact 

gtgtgccgcG 

ggcagccacc 

cacggctccc 

ggccacgcca 

gacccatcgc 

ctcaagtacc 

agcatgaccg 

ccctacgtgc 

CGcaccggct 

gtgaactgtg 

tgcaacgcct 

aagcgaaggc 

acaaccacac 

tactacaagc 

agaaaccgaa 

gacttcccca 

agccacatct 

ccgccatcca 

tagagccctg 

tgcatttttg 

cagaaagcaa 

accactgaat 

agggtctcta 

gaaatactgt 

aagtttaagg 

aaatggacaa 

agaaaataaa 

atttgttcaa 

ccgttcacca 

gaacaagtcc 

tgtagattta 

gtcttaagaa 

cggttgaata 



accctccgac 

ggttccgggc 

agccgcgctg 

accacccggg 

tggatgtgct 

cggtcagggc 

cgcctctgct 

acaccgcctc 

cggggcccct 

gcccgcacct 

tgtccacccc 

aggtgcccct 

ccctgggtgg 

ccgagtacag 

tcggatgcaa 

gggcaacctc 

gcgggctcta 

tgtctgcagc 

tctggaggag 

ttcacaatat 

aaatgtctag 

agaacagctc 

cgcccttcag 

gcctgtcctt 

ctcgatgctc 

caggagcagt 

aattatgttt 

ctggacGCca 

gtgctgtgaa 

acaatgactt 

aatatgggag 

actgccagtt 

aaaaagaaaa 

agctgttggc 

gttgccattg 

ctgtaattgt 

tttcatcata 

ctgctttctt 

aactagatta 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2365 



<210> 306 
<211> 1117 
<212> DNA 

<213> Homo sapiens 



<400> 306 

gcaccaacca 

gccatccgcc 

ggggacgctG 

gactttccca 

atcttgtgct 

cgtgtggaca 

tacaatccag 

aagctctact 

gtagattttc 

gacgccttcc 

gcctacatga 

ggcaacaagt 

gcccaggctg 

tcctqtttat 



gcaccatgcc catgatactg gggtactggg acatccgcgg 
tgctcctgga atacacagac tcaagctatg aggaaaagaa 
ctgattatga cagaagccag tggctgaatg aaaaattcaa 
atcngcccta cttgattgat ggggctcaca agatcaccca 
acattgcccg caagcacaac ctgtgtgggg agacagaaga 
ttttggagaa ccagaccatg gacaaccata tgcagctggg 
aatttgagaa actgaagcca aagtacttgg aggaactccc 
cagagtttct ggggaagcgg ccatggtttg caggaaacaa 
tcgtctatga tgtccttgac ctccaccgta tatttgagcc 
caaatctgaa ggacttcatc tcccgctttg agggcttgga 
agtccagccg cttcctccca agacctgtgt tctcaaagat 
agggccttga aggcaggagg tgggagtgag gagcccatac 
tgcagcgcag ctggactctg catcccagca cctgcctcct 
tcccatcttt actcccaaga cttcattgtc cctcttcact 
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gctggcccac 60 
gtacacgatg 120 
gctgggcctg 180 
gagcaacgcc 240 
ggagaagatt 300 
catgatctgc 360 
tgaaaagcta 420 
gatcactttt 480 
caactgcttg 540 
gaagatctct 600 
ggctgtctgg 660 
tcagcctgct 720 
cgttCGtttc 780 
ccccctaaaG 840 



SUBSTITUTE SHEET (RULE 26) 



BNSDOCID: <WO 03078662A 1 J_> 



WO 03/078662 



PCT/US03/07713 



ccctgtccca 
tcccatcatt 
tgtctgcttt 
tgttgcatga 
gcctaggcct 



tgcaggccct 
acccttccct 
aaagcctgcc 
aggagcagca 
acctgatgga 



39740-0001PCT . txt 
ttgaagcctc agctacccac tatccttcgt 
qcactaaagc cagcctgacc ttccttcctg 
tqqcccctcg cctgtggagc tcagccccga 
ttgactggtt tacaggccct gctcctgcag 
agtaaagcct caaccac 



gaacatcccc 900 
ttagtggttg 960 
gctgtccccg 1020 
catggtccct 1080 
1117 



<210> 307 
<211> 1266 
<212> DNA \ 
<213> Homo sapiens 



<400> 307 

ctcggaagcc 

gtgggctggc 

aacggtacac 

tcaagctaga 

cccagagcaa 

aagaagaaaa 

tgataaggct 

tacctggaca 

aaaagctcac 

accccaagtg 

tggagaaaat 

agatggccca 

ttttgtttca 

cacttatgtt 

ggatatgtgg 

actccccaac 

aaaaaacgag 

aagactgggt 

gaaaccgtcc 

ggctagccaa 

ttgacactat 

attgat 



<210> 308 

<211> 2162 

<212> DNA 

<213> Homo sapiens 



cgtcaccatg 

gcacgccatc 

gtgcggggaa 

cctggacttt 

tgccatcttg 

gattcgagtg 

ctgttacagc 

actgaaacaa 

ctttgtggat 

cctggatgag 

cgctgcctac 

gtggggcaac 

tcctgtccgt 

actggtgtcc 

agaatcatca 

tgagttcaag 

attgcacagt 

gaggataaca 

tgtgtacaca 

tagagttggc 

tcactaaaat 



tcgtgcgagt 

cgcctgctcc 

gctcctgact 

cctaatctgc 

cgctacatcg 

gacatcatag 

tctgaccacg 

ttctccatgt 

tttctcacct 

ttcccaaacc 

ttacagtctg 

aagcctgtat 

aaggggtcag 

agctgagttt 

agatatgaat 

ggctgtaggt 

tggagagagc 

caagtaaaac 

tgggagttta 

aattgcttat 

taatctggaa 



cgtctatggt 

tggagttcac 

atgatcgaag 

cctacctcct 

ctcgcaagca 

agaaccaagt 

aaaaactgaa 

ttctgtggaa 

atgatatctt 

tgaaggcttt 

atcagttctg 

gctgagcagg 

cgctcttgct 

ctcttgggta 

tgaatcgctg 

tcatgcccaa 

aggtgtgtta 

tgtggtactg 

gtgtgataaa 

tgaaactcat 

tttaaggccc 



tctcgggtac 
ggatacctct 
ecaatggctg 
ggatgggaag 
caacatgtgt 
aatggatttc 
gcctcagtac 
attctcatgg 
ggatcagaac 
catgtgccgt 
caagatgccc 
aggcagactt 
ttgctctttt 
taaaggctaa 
cgatactgtg 
gccctgagag 
aatggactgg 
atggacttaa 
ggcagtattt 
taaaaataat 
aacattaaac 



tgggatattc 60 
tatgaggaga 120 
gatgtgaaat 180 
aacaagatca 240 
ggtgagactg 300 
cgcacacaac 360 
ttggaagagc 420 
tttgccgggg 480 
cgtatatttg 540 
tttgaggctt 600 
atcaacaaca 660 
gcagagcttg 720 
caatgaatag 780 
aagggaaaaa 840 
gcatttccct 900 
tgggtactag 960 
agtccctgtg 1020 
ccggagttcg 1080 
cagactggtg 1140 
agagccccac 1200 
acaaagctgt 1260 
1266 



<400> 308 
gggctgcgct 
ggggcccgca 
tcagcaggcc 
gacaaccagt 
gggtctggga 
ccgcctggac 
ggcgtcagcc 
caatgtcatg 
cctgcatcag 
ggtgtttctg 
catcaccatg 
gtttggtctg 
cctagcaggc 
atcagaaagt 
ctgttacctg 
aggacccggg 
aggcaaagag 
tatcaaagcc 
tatcaccatt 
cagcacctgg 
ctggttgggc 
gccaagcctg 
gccccgccgc 
tgcctttgcc 
agtgaagcca 
tctggcactg 
cagaaggact 



gtccagctgt 
ggcctgggaa 
cctgagggag 
caccagcctc 
acgctgctcc 
atgtcccaga 
gcccctgcag 
accctatgtg 
aggatccccc 
atcactgcca 
atcaagatcg 
gctggccttc 
ttctttgcct 
gccttcggct 
ggcctgcccc 
gagcaggaga 
gaatctggag 
atcctgaaaa 
gggatgtttc 
gaacgttact 
cggagcctca 
gtgctggccc 
tacctgactg 
ttctccaacg 
gctgaggcag 
ggggctgttt 
gcctgcctcc 



ggctatggcc 
tttccgtccc 
ggagctgtca 
aggacagata 
cgtggaattt 
atgtgtcctt 
cacccttgcc 
ccatgctgcc 
agtccgtacg 
tcctggtgaa 
tgctcattaa 
tgcctgccag 
ccgtggccat 
actttatcac 
gcctggaatt 
ccaagttgga 
tttcagtctc 
atatctcagt 
cagccgtgac 
tcattcctgt 
cagctgtatt 
ggctggtgtt 
tggtcttcga 
gctacctcgc 
agaccgcagg 
tctccttcct 
ctccctgtct 



ccagccccga gatgaggagg 
ccaccaagtc cggatgctca 
gccagggaaa accgagaaca 
caaagctgtc tggcttatct 
tttcatgacg gccactcagt 
ggtcactgct gaactgagca 
tgagcggaac tctctcagtg 
cctgctgtta ttcacctacc 
gatcctgggc agcctggtgg 
ggtgcagctg gatgctctgc 
ttcatttggt gccatcctgc 
ctacacggcc cccatcatga 
gatctgcgct attgccagtg 
agcctgtgct gttatcattt 
ctaccgctac taccagcagc 
cctcattagc aaaggagagg 
caactctcag cccaccaatg 
cctggctttc nctgtctgct 
tgttgaggtc aagtccagca 
gtcctgtttc ttgactttca 
catgtggcct gggaaggaca 
tgtgccactg ctgctgctgt 
gcacgatgcc tggttcatct 
cagcctctgc atgtgcttcg 
agccatcatg gccttcttcc 
gttccgggca attgtgtgac 
gcctcctgcc ccttccttct 
Page 55 



gagagaacta 60 
ctccaaagtc 120 
ccatcaccat 180 
tcttcatgct 240 
atttcacaaa 300 
aggacgccca 360 
ccatcttcaa 420 
tcaactcctt 480 
ccatcctgct 540 
ccttctttgt 600 
agggcagcct 660 
gtggccaggg 720 
gctcggaact 780 
tgaccatcat 840 
tcaagcttga 900 
agccaagagc 960 
aaagecactc 1020 
tcatcttcac 1080 
tcgcaggcag 1140 
atatctttga 1200 
gccgctggct 1260 
gcaacattaa 1320 
tcttcatggc 1380 
ggcccaagaa 1440 
tgtgtctggg 1500 
aaaggatgga 1560 
gccaggggtg 1620 
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PCT/US03/07713 



atcctgagtg 
gggcccggat 
gtttggggct 
tcttggctct 
acgtgtgtct 
gtcctggggt 
tctcccctgc 
agtttttacc 
tttgacagaa 
tt 



gtGtggcggt 
ctccaggccc 
cagagtcgag 
gactgatccc 
ctgtgtatgt 
ggctaggagc 
gcctcctcct 
catcatgcac 
accaggtgcc 



39740-O0OlPCT.txt 

tttttcttct aactgacttc tgctttccac ggcgtgtgct 1680 

tggggaggga gcctctggac ggacagtggg gacattgtgg 1740 

ggacggggtg tagcctcggc atttgcttga guttictccac 1800 

tgcttgtgca ggccagtgga ggctcttggg cttggagaac 1860 

gtctgtgtgt ctgcgtccgt gtctgtcaga ctgtctgcct 1920 

tgggtctgac cgttgtatgg tttgacctga tatactccat 1980 

ctgtgttttt tccatgtccc cctcccaact ccccatgccc 2040 

cctgtacagt tgccacgtta ctgccttttt taaaaatata 2100 

ttcagaggct ctctgattta aataaacctt tcttgttttt 2160 

2162 



<210> 309 

<211> 3933 

<212> DNA 

<213> Homo sapiens 



<400> 309 

cacgaggcag 

acctgaggag 

gtctcacgag 

tccgcgtgtg 

agacatcgcg 

ataagttctg 

gaatctgaag 

catcttgata 

ctggatgctg 

ttgaaagcct 

tctgataatg 

tttgatttta 

ggccttgtga 

tgtaccctaa 

cactgcacag 

aagaaaccac 

attgaaattc 

tcttattgtg 

cgctcaattt 

gatatgttta 

ggatatgtct 

cagtgcattg 

tcccttcaac 

cagctattca 

gaacctgatg 

tttggcagca 

gatgtaatgc 

ttacccaccg 

gaagttgcat 

cagattcagg 

cctaatagtc 

ttggaattgg 

caggacacag 

cagttacgtt 

gcaagtcctc 

aatgccacca 

gaagacatta 

agtgccacat 

ggaaaaggag 

gtcgctttga 

ttgcagaatg 

ggaattggaa 

aaacgtgtaa 

ttaataccct 

ccacagctga 

ctgcagggtg 

cattcctttt 

atctaatttt 

ccctttctac 

agacagctca 

taaaaaatgc 



cactctcttc gtcgcttcgg ccagtgtgtc gggctgggcc 
aggctcggag ccgggcccgg accccggcga ttgccgcccg 
gggtttcccg cctcgcaccc ccacctctgg acttgccttt 
gagggagcca gcgcttaggc cggagcgagc ctgggggccg 
gggaccgatt caccatggag ggcgccggcg gcgcgaacga 
aacgtcgaaa agaaaagtct cgagatgcag ccagatctcg 
ttttttatga gcttgctcat cagttgccac ttccacataa 
aggcctctgt gatgaggctt accatcagct atttgcgtgt 
gtgatttgga tattgaagat gacatgaaag cacagatgaa 
tggatggttt tgttatggtt ctcacagatg atggtgacat 
tgaacaaata catgggatta actcagtttg aactaactgg 
ctcatccatg tgaccatgag gaaatgagag aaatgcttac 
aaaagggtaa agaacaaaac acacagcgaa gcttttttct 
ctagccgagg aagaactatg aacataaagt ctgcaacatg 
gccacattca cgtatatgat accaacagta accaacctca 
ctatgacctg cttggtgctg atttgtgaac ccattcctca 
ctttagatag caagactttc ctcagtcgac acagcctgga 
atgaaagaat taccgaattg atgggatatg agccagaaga 
atgaatatta tcatgctttg gactctgatc atctgaccaa 
ctaaaggaca agtcaccaca ggacagtaca ggatgcttgc 
gggttgaaac tcaagcaact gtcatatata acaccaagaa 
tatgtgtgaa ttacgttgtg agtggtatta ttcagcacga 
aaacagaatg tgtccttaaa ccggttgaat cttcagatat 
ccaaagttga atcagaagat acaagtagcc tctttgacaa 
ctttaacttt gctggcccca gccgctggag acacaatcat 
acgacacaga aactgatgac cagcaacttg aggaagtacc 
tcccctcacc caacgaaaaa ttacagaata taaatttggc 
ctgaaacgcc aaagccactt cgaagtagtg ctgaccctgc 
taaaattaga accaaatcca gagtcactgg aactttcttt 
atcagacacc tagtccttcc gatggaagca ctagacaaag 
ccagtgaata ttgtttttat gtggatagtg atatggtcaa 
tagaaaaact ttttgctgaa gacacagaag caaagaaccc 
atttagactt ggagatgtta gctccctata tcccaatgga 
ccttcgatca gttgtcacca ttagaaagca gttccgcaag 
aaagcacagt tacagtattc cagcagactc aaatacaaga 
ctaccactgc caccactgat gaattaaaaa cagtgacaaa 
aaatattgat tgcatctcca tctcctaccc acatacataa 
catcaccata tagagatact caaagtcgga cagcctcacc 
tcatagaaca gacagaaaaa tctcatccaa gaagccctaa 
gtcaaagaac tacagttcct gaggaagaac taaatccaaa 
ctcagagaaa gcgaaaaatg gaacatgatg gttcactttt 
cattattaca gcagccagac gatcatgcag ctactacatc 
aaggatgcaa atctagtgaa cagaatggaa tggagcaaaa 
ctgatttagc atgtagactg ctggggcaat caatggatga 
ccagttatga ttgtgaagtt aatgctccta tacaaggcag 
aagaattact cagagctttg gatcaagtta actgagcttt 
tttggacact ggtggctcac tacctaaagc agtctattta 
agaagcctgg ctacaatact gcacaaactt ggttagttca 
ttaatttaca ttaatgctct tttttagtat gttctttaat 
ttttctcagt tttttggtat ttaaaccatt gcattgcagt 
acctttttat ttatttattt ttggctaggg agtttatccc 
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ctgacaagcc 60 
cttctctcta 120 
ccttctcttc 180 
cccgccgtga 240 
caagaaaaag 300 
gcgaagtaaa 360 
tgtgagttcg 420 
gaggaaactt 480 
ttgcttttat 540 
gatttacatt 600 
acacagtgtg 660 
acacagaaat 720 
cagaatgaag 780 
gaaggtattg 840 
gtgtgggtat 900 
cccatcaaat 960. 
tatgaaattt 1020 
acttttaggc 1080 
aactcatcat 1140 
caaaagaggt 1200 
ttctcaacca 1260 
cttgattttc 1320 
gaaaatgact 1380 
acttaagaag 1440 
atctttagat 1500 
attatataat 1560 
aatgtctcca 1620 
actcaatcaa 1680 
taccatgccc 1740 
ttcacctgag 1800 
tgaattcaag 1860 
attttctact 1920 
tgatgacttc 1980 
ccctgaaagc 2040 
acctactgct 2100 
agaccgtatg 2160 
agaaactact 2220 
aaacagagca 2280 
cgtgttatct 2340 
gatactagct 2400 
tcaagcagta 2460 
actttcttgg 2520 
gacaattatt 2580 
aagtggatta 2640 
cagaaaccta 2700 
ttcttaattt 2760 
tattttctac 2820 
atttttgatc 2880 
gctggatcac 2940 
agcatcattt 3000 
tttttcgaat 3060 



SUBSTITUTE SHEET (RULE 26) 
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tatttttaag 
ataggcagtt 
tgccagcagt 
tactcatgga 
ctatgaaatt 
gctgtatggt 
atatctagaa 
atgtttctat 
tutatgctaa 
aaagatattt 
gcctagtatg 
acatcctttt 
atatatagtt 
tccttttgct 
acatcttctg 



aagatgccaa 
gaaaaatttt 
acgtggtagc 
atatattctg 
gttaaacctg 
ttattattta 
ggtatgtggc 
agtcactttg 
tattgtgtaa 
tgagcagact 
ttaatttgct 
tttcatgtag 
gtcacagtaa 
ctttgtggtt 
tggaaaaaaa 



39740-0001PCT.txt 
tataattttt gtaagaaggc agtaaccttt catcatgatc 
tacacctttt ttttcacatt ttacataaat aataatgctt 
cacaattgca caatatattt tcttaaaaaa taccagcagt 
cqtttataaa actagttttt aagaagaaat tttttttggc 
qaacatgaca ttgttaatca tataataatg attcttaaat 
aatqggtaaa gccatttaca taatatagaa agatatgcat 
atttatttgg ataaaattct caattcagag aaatcatctg 
ccagctcaaa agaaaacaat accctatgta gttgtggaag 
ctgatattaa acctaaatgt tctgcctacc ctgttggtat 
qtaaacaaga aaaaaaaaat catgcattct tagcaaaatt 
caaaatacaa tgtttgattt tatgcacttt 9*cgctatta 
atttcaataa ttgagtaatt ttagaagcat tattttagga 
atatcttgtt ttttctatgt acantgtaca aatttttcat 
ggatctaaca ctaactgtat tgttttgtta catcaaataa 
aaaaaaaaaa aaa 



3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3933 



<210> 310 

<211> 2872 

<212> DNA 

<213> Homo sapiens 



<400> 310 
tccaggaatc 
ggttgtattg 
accagcgact 
agcaacatga 
accatgacta 
gccttagggg 
gccatgaaca 
ggcatgggcg 
gcggccgcca 
agccgcgcgg 
ccgccctact 
ctcacgctga 
cagcagcgct 
gtggcacgct 
ggcaacatgt 
cagccggggg 
cctgagagcc 
cggggtgtgc 
agcccccaga 
actccagcct 
cccgcctctc 
gacccccact 
cagcagcata 
tctacgttgc 
gagccctcag 
acttcctagc 
aaaaaatcaa 
acaactattt 
tttattattg 
tttttcctgc 
ccccctctct 
aaaacaaaaa 
tacctgaccc 
tggtgaagaa 
ttatatagca 
ttctgataca 
ttaattatgc 
gtaatagata 
gaccccaacc 
cctcaagaac 
acacataaat 
ttacatgttg 
ttcaaaaatc 
agtgttatgc 
ctttgctttc 
gctttaaatg 



gatagtgcat 
ggcagggtgg 
ggaacagcta 
actcaggcct 
cgagcggcaa 
ccggcctgag 
gcatgactgc 
ccatgggtgc 
tgaacccgtg 
gcggcggcgg 
cgtacatctc 
gcgagatcta 
ggcagaactc 
ccccggacaa 
tcgagaacgg 
ccggcggcgg 
gcaaggaccc 
acgggaagac 
ctctggacca 
cctcaactgc 
acccggcaca 
actccttcaa 
agctggactt 
ccgccagcct 
ccctggagcc 
tcccgggact 
cagcaaacaa 
ttatttcatt 
tattcaaaat 
gaagtttaat 
ttcttccctc 
ggaagatggt 
cctacgagtt 
acaagtatac 
agacataaac 
tgcagatagc 
acatgctcag 
ggtgatatac 
cctttgctct 
tcgcttcttt 
tagtttctat 
ggaggaggag 
ctattgatag 
actttccaca 
tcaatgttaa 
atggttaaaa 



tcgtgcgcgc 
ctccaggatg 
ctacgcagac 
gggctccatg 
catgaccccg 
tcccggcgca 
ggccggcgtg 
gcagcaggcg 
catgagcccc 
cgacgccaag 
gctcatcacc 
ccagtggatc 
catccgccac 
gccgggcaag 
ctgctacttg 
gggcgggagc 
ctctggcgcc 
cggccagcta 
cagtggggcg 
gccccccata 
cggcttggca 
ccacccgttc 
caaggcatac 
gcctctaggc 
ggcgtactac 

ggggggtttg 

aaccacacaa 
tttcatgcac 
tcattgtgta 
gatccacaag 
ttggccctcc 
caagtttgta 
tacaggcttg 
actagaggct 
agattataaa 
agatgtcttt 
atgtgtagac 
gtgatacgtt 
ctacccacag 
gctttgagag 
gagtgtatac 
ataagttata 
tggccatttt 
gttggtgtta 
tttattgcat 
ttacaaatta 



ggccgcccgt cgcttcgcac 
ttaggaactg tgaagatgga 
acgcaggagg cctactcctc 
aactccatga acacctacat 
gcgtccttca acatgtccta 
gtagccggca tgccgggggg 
acggccatgg gtacggcgct 
gcctccatga tgaatggcct 
atggcgtacg cgccgtccaa 
acgttcaagc gcagttaccc 
atggccatcc agcgggcgcc 
atggacctct tcccctatta 
tcgctgtcct tcaatgactg 
ggctcctact ggacgctgca 
cgccgccaga agcgcttcaa 
ggaagcgggg gcagcggcgc 
tctaacccca gcgccgactc 
gagggcgcgc cggccccggg 
acggcgacag ggggcgcctc 
agctccgggc ccggggcgct 
ccccacgagt cccagctgca 
tccatcaaca acctcatgtc 
gaacaggcac tgcaatactc 
agcgcctcgg tgaccaccag 
caaggtgtgt attccagacc 
tctggcatag ccatgctggt 
accaaaccgt caacagcata 
aaccttgccc ccagtgcaaa 
tattactaca aagacggccc 
tgtatatatig aaattctcct 
agacattcta gtttgtggag 
aaatatttgt ttgtgctttt 
tggcaatact cttaaccata 
cttaaaagta ttgaaaagac 
catcagagcc atttgcttct 
aaatgaaata catgtatatt 
atcctccgta tatttacata 
ctcaagagtt gcttgaccga 
atggccctgg gaacaatcct 
tgccatggtc atgtcattct 
catttaaaga ttttttcagt 
gggagctgga tttcaaacgg 
aatcattgcc atcgtgtgct 
gtatagccag agggtttcat 
ggtttattct ttttctttac 
aattgggaat ttttatcaat 
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agggctggat 
agggcatgaa 
ggtcccggtc 
gaccatgaac 
tgccaacccg 
ctcggcgggc 
gagcccgagc 
gggcccctac 
cctgggccgc 
gcacgccaag 
cagcaagatg 
ccggcagaac 
cttcgtcaag 
cccggactcc 
gtgcgagaag 
caagggcggc 
gcccctccat 
cccggccgcc 
ggagttgaag 
ggcctctgtg 
cctgaaaggg 
ctcctcggag 
gccttacggc 
gagccceatc 
cgtcctaaac 
agcaagagag 
ataaaatcca 
agactgttac 
caaaccaatt 
ccttccttgc 
ggttatttaa 
cccccctcct 
agaattgaaa 
aatactgctg 
cagtttacat 
gtgtatggac 
acatatagag 
aagttacaag 
caggaattgc 
gaggtacata 
aaagggaata 
tggtccaaga 
tgtttcatcc 
tattatttct 
agctgaaatt 
gtgattgtaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
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ttaaaaatat tttgatttaa ataacaaaaa taataccaga ttttaagccg cggaaaatgt 2820 
tcttgatcat ttgcagttaa ggactttaaa taaatcaaat gttaacaaaa aa 2872 

<210> 311 
<211> 926. 
<212> DNA 

<213> Homo sapiens 
<400> 311 

ggggcccatt ctgtttcagc cagtcgccaa gaatcatgaa agtcgccagt ggcagcaccg 60 

ccaccgecgc cgcgggcccc agctgcgcgc tgaaggccgg caagacagcg agcggtgcgg 120 

gcgaggtggt gcgctgtctg tctgagcaga gcgtggccat ctcgcgctgc cggggcgccg 180 

gggcgcgcct gcctgccctg ctggacgagc agcaggtaaa cgtgctgctc tacgacatga 240 

acggctgtta ctcacgcctc aaggagctgg tgcccaccct gccccagaac cgcaaggtga 300 

gcaaggtgga gattctccag cacgtcatcg actacatcag ggaccttcag ttggagctga 360 

actcggaatc cgaagttggg acccccgggg gccgagggct gccggtccgg gctccgctca 420 

gcaccctcaa cggcgagatc agcgccctga cggccgaggc ggcatgcgtt cctgcggacg 480 

atcgcatctt gtgtcgctga agcgcctccc ccagggaccg gcggacccca gccatccagg 540 

gggcaagagg aattacgtgc tctgtgggtc tcccccaacg cgcctcgccg gatctgaggg 600 

agaacaagac cgatcggcgg ccactgcgcc cttaactgca tccagcctgg ggctgaggct 660 

gaggcactgg cgaggagagg gcgctcctct ctgcacacct actagtcacc agagacttta 720 

gggggtggga ttccactcgt gtgtttctat tttttgaaaa gcagacattt taaaaaatgg 780 

tcacgtttgg tgcttctcag atttctgagg aaattgcttt gtattgtata ttacaatgat 840 

caccgactga gaatattgtt ttacaatagt tctgtggggc tgtttttttg ttattaaaea 900 

aataatttag atggtgaaaa aaaaaa 926 

<210> 312 
<211> 4989 
<212> DNA 

<213> Homo sapiens 
<400> 312 

tttttttttt ttttgagaaa gggaatttca tcccaaataa aaggaatgaa gtctggctcc 60 

ggaggagggt ccccgacctc gctgtggggg ctcctgtttc tctccgccgc gctctcgctc 120 

tggccgacga gtggagaaat ctgcgggcca ggcatcgaca tccgcaacga ctatcagcag 180 

ctgaagcgcc tggagaactg cacggtgatc gagggctacc tccacatcct gctcatctcc 240 

aaggccgagg actaccgcag ctaccgcttc cccaagctca cggtcattac cgagtacttg 300 

ctgctgttcc gagtggctgg cctcgagagc ctcggagacc tcttccccaa cctcacggtc 360 

atccgcggct ggaaactctt ctacaactac gccctggtca tcttcgagat gaccaatctc 420 

aaggatattg ggctttacaa cctgaggaac attactcggg gggccatcag gattgagaaa 480 

aatgctgacc tctgttacct ctccactgtg gactggtccc tgatcctgga tgcggtgtcc 540 

aataactaca ttgtggggaa taagccccca aaggaatgtg gggacctgtg tccagggacc 600 

atggaggaga agccgatgtg tgagaagacc accatcaaca atgagtacaa ctaccgctgc 660 

tggaccacaa accgctgcca gaaaatgtgc ccaagcacgt gtgggaagcg ggcgtgcacc 720 

gagaacaatg agtgctgcca ccccgagtgc ctgggcagct gcagcgcgcc tgacaacgac 780 

acggcctgtg tagcttgccg ccactactac tatgccggtg tctgtgtgcc tgcctgcccg 840 

cccaacacct acaggtttga gggctggcgc tgtgtggacc gtgacttctg cgccaacatc 900 

ctcagcgccg agagcagcga ctccgagggg tttgtgatcc acgacggcga gtgcatgcag 960 

gagtgcccct cgggcttcat ccgcaacggc agccagagca tgtactgcat cccttgtgaa 1020 

ggtccttgcc cgaaggtctg tgaggaagaa aagaaaacaa agaccattga ttctgttact 1080 

tctgctcaga tgctccaagg atgcaccatc ttcaagggca atttgctcat taacatccga 1140 

cgggggaata acattgcttc agagctggag aacttcatgg ggctcatega ggtggtgacg 1200 

ggctacgtga agatccgcca ttctcatgcc ttggtctcct tgtccttcct aaaaaacctt 1260 

cgcctcatcc taggagagga gcagctagaa gggaattact ccttctacgt cctcgacaac 1320 

cagaacttgc agcaactgtg ggactgggac caccgcaacc tgaccatcaa agcagggaaa 1380 

atgtactttg ctttcaatcc caaattatgt gtttccgaaa tttaccgcat ggaggaagtg 1440 

acggggacta aagggcgcca aagcaaaggg gacataaaca ccaggaacaa cggggagaga 1500 

gcctcctgtg aaagtgacgt cctgcatttc acctccacca ccacgtcgaa gaatcgcatc 1560 

atcataacct ggcaccggta ccggccccct gactacaggg atctcatcag cttcaccgtt 1620 

tactacaagg aagcaccctt taagaatgtc acagagtatg atgggcagga tgcctgcggc 1680 

tccaacagct ggaacatggt ggacgtggac ctcccgccca acaaggacgt ggagcccggc 1740 

atcttactac atgggctgaa gccctggact cagtacgccg tttacgtcaa ggctgtgacc 1800 
ctcaccatgg tggagaacga ccatatccgt ggggccaaga gtgagatctt gtacattcgc 1860 

accaatgctt cagttccttc cattcccttg gacgttcttt cagcatcgaa ctcctcttct 1920 

cagttaatcg tgaagtggaa ccctccctct ctgcccaacg gcaacctgag ttactacatt 1980 

gtgcgctggc agcggcagcc tcaggacggc tacctttacc ggcacaatta ctgctccaaa 2040 

gacaaaatcc ccatcaggaa gtatgccgac ggcaccatcg acattgagga ggtcacagag 2100 

aaccccaaga ctgaggtgtg tggtggggag aaagggcctt gctgcgcctg ccccaaaact 2160 
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gaagccgaga 

ctgcacaact 

gccaacacca 

accgacccgg 

gagagaactg 

tgcaaccacg 

atgcccgcag 

aactccatct 

gaaataaaat 

aggaagtatg 

caggccacat 

gccaaaacag 

ttgatcgtgg 

aggctgggga 

gtgtacgttc 

gggcaggggt 

cctgaaacca 

gagtttctca 

ctgggtgtgg 

gatctcaaaa 

cctccaagcc 

ctcaacgcca 

gatttcacag 

taccggaaag 

gatggagtct 

gccacactgg 

atggagggcg 

cgcatgtgct 

atcaaagagg 

aagctgcccg 

gacccctcgg 

gagaacggcc 

tacgcccaca 

acctgctgat 

ggtggggggg 

cttcagttct 

atgttccttt 

tgggccttta 

tcctcactct 

ggaaaaataa 

tgtccctgtg 

aaaaacacgt 

ttacaaaggg 

aaatcctgaa 

tgagcatggc 

gactgcccct 

attattattt 

tgaaccggc 



agcaggccga 

ccatcttcgt 

ccatgtccag 

aagagctgga 

tcatttctaa 

aggctgagaa 

aaggagcaga 

ttttaaagtg 

acggatcaca 

gaggggccaa 

ctctctctgg 

gatatgaaaa 

gagggttggt 

atggagtgct 

ctgatgagtg 

cgtttgggat 

gagtggccat 

acgaagcttc 

tgtcccaagg 

gttatctccg 

tgagcaagat 

ataagttcgt 

tcaaaatcgg 

gaggcaaagg 

tcaccactta 

ccgagcagcc 

gccttctgga 

ggcagtataa 

agatggagcc 

agccggagga 

cctcctcgtc 

ccggccctgg 

tgaacggggg 

ccttggatcc 

gagagagagt 

gcccttgctg 

tttcaatatg 

agaaccttaa 

gtccctgtcc 

ttgccacaag 

gccccatcca 

ggagatggaa 

ccatcgttca 

ctttctccct 

agctggttgc 

gctgtgctgc 

gggggaactg 



39740-000lPCT.txt 
gaaggaggag gctgaatacc gcaaagtctt tgagaatttc 2220 
gcccagacct gaaaggaagc ggagagatgt catgcaagtg 2280 
ccgaagcagg aacaccacgg ccgcagacac ctacaacatc 2340 
gacagagtac cctttctttg agagcagagt ggataacaag 2400 
ccttcggcct ttcacattgt accgcatcga tatccacagc 2460 
gctgggctgc agcgcctcca acttcgtctt tgcaaggact 2520 
tgacattcct gggccagtga cctgggagcc aaggcctgaa 2580 
gccggaacct gagaatccca atggattgat tctaatgtat 2640 
agttgaggat cagcgagaat gtgtgtccag acaggaatac 2700 
gctaaaccgg ctaaacccgg ggaactacac agcccggatt 2760 
gaatgggtcg tggacagatc ctgtgttctt ctatgtccag 2820 
cttcatccat ctgatcatcg ctctgcccgt cgctgtcctg 2880 
gattatgctg tacgtcttcc atagaaagag aaataacagc 2940 
gtatgcctct gtgaacccgg agtacttcag cgctgctgat 3000 
ggaggtggct cgggagaaga tcaccatgag ccgggaactt 3060 
ggtctatgaa ggagttgcca agggtgtggt gaaagatgaa 3120 
taaaacagtg aacgaggccg caagcatgcg tgagaggatt 3180 
tgtgatgaag gagttcaatt gtcaccatgt ggtgcgattg 3240 
ccagccaaca ctggtcatca tggaactgat gacacggggc 3300 
gtctctgagg ccagaaatgg agaataatcc agtcctagca 3360 
gattcagatg gccggagaga ttgcagacgg catggcatac 3420 
ccacagagac cttgctgccc ggaattgcat ggtagccgaa 3480 
agattttggt atgacgcgag atatctatga gacagactat 3540 
gctgctgccc gtgcgctgga tgtctcctga gtccctcaag 3600 
ctcggacgtc tggtccttcg gggtcgtcct ctgggagatc 3660 
ctaccagggc ttgtccaacg agcaagtcct tcgcttcgtc 3720 
caagccagac aactgtcctg acatgctgtt tgaactgatg 3780 
ccccaagatg aggccttcct tcctggagat catcagcagc 3840 
tggcttccgg gaggtctcct tctactacag cgaggagaac 3900 
gctggacctg gagccagaga acatggagag cgtccccctg 3960 
ctccctgcca ctgcccgaca gacactcagg acacaaggcc 4020 
ggtgctggtc ctccgcgcca gcttcgacga gagacagcct 4080 
ccgcaagaac gagcgggcct tgccgctgcc ccagtcttcg 4140 
tgaatctgtg caaacagtaa cgtgtgcgca cgcgcagcgg 4200 
tttaacaatc cattcacaag cctcctgtac ctcagtggat 4260 
cccgcgggag acagcttctc tgcagtaaaa cacatttggg 4320 
caagcagctt tttattccct gcccaaaccc ttaactgaca 4380 
tgacaacact taatagcaac agagcacttg agaaccagtc 4440 
ttccctgttc tccctttctc tctcctctct gcttcataac 4500 
tccagctggg aagccctttt tatcagtttg aggaagtggc 4560 
accactgtac acacccgcct gacaccgtgg gtcattacaa 4620 
atttttacct ttatctttca cctttctagg gacatgaaat 4680 
tccaaggctg ttaccatttt aacgctgcct aattttgcca 4740 
catcggcccg gcgctgattc ctcgtgtccg gaggcatggg 4800 
tccatttgag agacacgctg gcgacacact ccgtccatcc 4860 
tcaaggccac aggcacacag gtctcattgc ttctgactag 4920 
gacacaatag gtctttctct cagtgaaggt ggggagaagc 4980 

4989 



<210> 313 
<211> 12515 
<212> DNA 

<213> Homo sapiens 



<400> 313 

ctaccgggcg 

gacgagcggt 

aatttgcttc 

acttactgtt 

cgacggtccc 

tgacatccgt 

gcaggaggca 

tattgatgag 

caggtatgaa 

acgtgaacag 

gaaagctcaa 

tcaggtacat 

tgctcaggga 



gaggtgagcg cggcgccggc tcctcctgcg gcggactttg 
ggttcgacaa gtggccttgc gggccggatc gtcccagtgg 
tggccttccc ctacggatta tacctggcct tcccctacgg 
tagaaaatgt ggcccacgag acgcctggtt actatcaaaa 
cactttcccc tgagcctcag cacctgcttg tttggaaggg 
atccagcttc ctgttgtgtc aaaacaacat tgcaaaattg 
atattacata atttcagttc cacaaatcca acacaagtaa 
cctgtacggc taaaacatgg agatgtaata actattattg 
aatgaaagtc ttcagaatgg aaggaagtca actgaatttc 
gagccagcac gtcgtgtctc aagatctagc ttctcttctg 
gattccaagg cctattcaaa aatcactgaa ggaaaagttt 
atcaagaatg tcaaagaaga cagtaccgca gatgactcaa 
acaactaatg ttcattcctc agaacatgct ggacgtaatg 
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ggtgcgactt 60 
aagagttgta 120 
attatactca 180 
ggagcggggt 240 
gtattgaatg 300 
aaatccatga 360 
atgggtctgt 420 
atcgttcctt 480 
caagaaaaat 540 
accctgatga 600 
caggaaatcc 660 
aagacagtgt 720 
gcagaaatgc 780 



SUBSTITUTE SHEET (RULE 26) 



BNSDOCID: <WO 0307B662A 1 J_> 
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PCT/US03/07713 



agctgatccc atttctgggg 
tggagaattg aagtctgttc 
tcccttttgg aagctttatg 
aaatgtccta cagtattgta 
aagtgctgat ggtttacagg 
atctggtggg agcggccacg 
gaacaagggg aagggaagag 
cagctttcct ctctatgagc 
aaattctcca caaaaacata 
gaatctgggt aaaagtgaag 
ttcaactaga aatcgaacac 
agaaaatctc tcttccaaaa 
ggaaactgaa attcacaatg 
gatccaaaag gattccctca 
ctctgggtta cctggtctta 
gagtgaggga atacctttga 
actatttgat gaaaacttgc 
aagaaagtct ctggtaatgc 
tcaaccatca ggaaaacaag 
cttggttata agccctccag 
ccgtaggtcc tgcaaaacag 
gagaggagga gaaagagtgg 
acatgatatt ttacagatga 
gattgttgca aaatcatggg 
agtcataaaa catggtcctc 
aaagaagcct gtgggcgaag 
taccataata atagggaaag 
agtgctcaac aacttcattt 
agctgaaatg ttcaagaccc 
cgctatttca aattcagaga 
agaacctctg ctccccacct 
tgcagcaaaa cagccatctg 
tagagaaaat ggaaacgtag 
gacaaaaact tcagatactg 
aggaaggtct acagagttca 
aacaaataca gaaattgttg 
acaaaggaga gaaggagaga 
tattgaatta aaagaaaacg 
gcagaaatgt gcaccaatgt 
catgaaagac acggcacgtg 
aaagagtgag aaaggcaaaa 
aaacacccca acacacacaa 
agaagagctc ctagcagtcg 
cagagagcca gcaggagatg 
cctggaccca gcagcccgtg 
ggcccagtca ctagaagacc 
tgaggaatca atgactgatg 
atcagtggac actccaacaa 
tgtagaggaa gaattcttag 
tacgcccaaa ccagcaggag 
gcagaaactg gacctggcag 
ggaaaaggcc caggctctag 
tcacaccgag gaattagtgg 
gtcagaccca gtggacaccc 
agcagatgta gagggagaac 
catgcacacg cctaaaccat 
tccagtgcag aaactggacc 
tcctaaggaa gaggcccagg 
ccctggtcat actgaagaag 
ttctccacca gaatcagcag 
ggagaaaagg gacgtacaga 
ggaaaccaca cacacagata 
ggaaactgca aaacagaaac 
aaaaactaag gaaaaggccc 
gacaccagta tgcactgaca 
atcacaacca gacccagtgg 
caggaaagtg gacgtagaag 
caaagccatg cacacaccca 



39740-000lPCT.txt 
attttaaaga aatttccagc gttaaattag tgagccgtta 840 
ccactacaca atgtcttgac aatagcaaaa aaaatgaatc 900 
agtcagtgaa gaaagagttg gatgtaaaat cacaaaaaga 960 
gaaaatctgg attacaaact gattacgcaa cagagaaaga 1020 
gggagaccca actgttggtc tcgcgtaagt caagaccaaa 1080 
ctgtggcaga gcctgcttca cctgaacaag agcttgacca 1140 
acgtggagtc tgttcagact cccagcaagg ctgtgggcgc 1200 
cggctaaaat gaagacccct gtacaatatt cacagcaaca 1260 
agaacaaaga cctgtatact actggtagaa gagaatctgt 1320 
gcttcaaggc tggtgataaa actcttactc ccaggaagct 1380 
cagctaaagt tgaagatgca gctgactctg ccactaagcc 1440 
ccagaggaag tattcctaca gatgtggaag ttctgcctac 1500 
agccattttt aactctgtgg ctcactcaag ttgagaggaa 1560 
gcaagcctga gaaattgggc actacagctg gacagatgtg 1620 
gttcagttga tatcaacaac tttggtgatt ccattaatga 1680 
aaagaaggcg tgtgtecttt ggtgggcacc taagacctga 1740 
ctcctaatac gcctctcaaa aggggagaag ccccaaccaa 1800 
acactccacc tgtcctgaag aaaatcatca aggaacagcc 1860 
agtcaggttc agaaatccat gtggaagtga aggcacaaag 1920 
ctcctagtcc taggaaaact ccagttgcca gtgatcaacg 1980 
cccctgcttc cagcagcaaa tctcagacag aggttcctaa 2040 
caacctgcct tcaaaagaga gtgtctatca gccgaagtca 2100 
tatgttccaa aagaagaagt ggtgcttcgg aagcaaatct 2160 
cagatgtagt aaaacttggt gcaaaacaaa cacaaactaa 2220 
aaaggtcaat gaacaaaagg caaagaagac ctgctactcc 2280 
ttcacagtca atttagtaca ggccacgcaa actctccttg 2340 
ctcatactga aaaagtacat gtgcctgctc gaccctacag 2400 
ccaaccaaaa aatggacttt aaggaagatc tttcaggaat 2460 
cagtgaagga gcaaccgcag ttgacaagca catgtcacat 2520 
atttgcttgg aaaacagttt caaggaactg attcaggaga 2580 
cagagagttt tggaggaaat gtgttcttca gtgcacagaa 2640 
ataaatgctc tgcaagccct cccttaagac ggcagtgtat 2700 
caaaaacgcc caggaacacc tacaaaatga cttctctgga 2760 
agacagagcc ttcaaaaaca gtatccactg taaacaggtc 2820 
ggaatataca gaagctacct gtggaaagta agagtgaaga 2880 
agtgcatcct aaaaagaggt cagaaggcaa cactactaca 2940 
tgaaggaaat agaaagacct tttgagacat ataaggaaaa 3000 
atgaaaagat gaaagcaatg aagagatcaa gaacttgggg 3060 
ctgacctgac agacctcaag agcttgcctg atacagaact 3120 
gccagaatct cctccaaacc caagatcatg ccaaggcacc 3180 
tcactaaaat gccctgccag tcattacaac cagaaccaat 3240 
aacaacagtt gaaggcatcc ctggggaaag taggtgtgaa 3300 
gcaagttcac acggacgtca ggggagacca cgcacacgca 3360 
gcaagagcat cagaacgttt aaggagtctc caaagcagat 3420 
taactggaat gaagaagtgg ccaagaacgc ctaaggaaga 3480 
tggctggctt caaagagctc ttccagacac caggtccctc 3540 
agaaaactac caaaatagcc tgcaaatctc caccaccaga 3600 
gcacaaagca atggcctaag agaagtctca ggaaagcaga 3660 
cactcaggaa actaacacca tcagcaggga aagccatgct 3720 
gtgatgagaa agacattaaa gcatttatgg gaactccagt 3780 
gaactttacc tggcagcaaa agacagctac agactcctaa 3840 
aagacctggc tggctttaaa gagctcttcc agactcctgg 3900 
ctgctggtaa aaccactaaa ataccctgcg actctccaca 3960 
caacaagcac aaagcaacga cccaagagaa gtatcaggaa 4020 
tcttagcgtg caggaatcta atgccatcag caggcaaagc 4080 
cagtaggtga agagaaagac atcatcatat ttgtgggaac 4140 
tgacagagaa cttaaccggc agcaagagac ggccacaaac 4200 
ctctggaaga cctgactggc tttaaagagc tcttccagac 4260 
cagtggctgc tggcaaaact actaaaatgc cctgcgaatc 4320 
acaccccaac aagcacaaga aggcagccca agacaccttt 4380 
aggagctctc agccctgaag aagctcacac agacatcagg 4440 
aagtaccagg aggtgaggat aaaagcatca acgcgtttag 4500 
tggacccagc agcaagtgta actggtagca agaggcaccc 4560 
aacccctaga agacctggct ggctggaaag agctcttcca 4620 
agcccacgac tcacgagaaa actaccaaaa tagcctgcag 4680 
acacaccaac aagctccaag ccacagtcca agagaagtct 4740 
aagaattctt cgcacteagg aaacgaacac catcagcagg 4800 
aaccagcagt aagtggtgag aaaaacatct acgcatttat 4860 
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gggaactcca 
acaaactcct 
ccagacacga 
caaatcttca 
atccctgggg 
atcaggagag 
atttatggag 
gcagctgaga 
gctcttccag 
atcctacaga 
caagagaagt 
gccatcagca 
caacacgttt 
caatagacgg 
cagagagctt 
aaaaatactc 
acggcccaag 
actaacacca 
agacatcaac 
tggcagcaag 
tggcttcaaa 
aatcacagaa 
caagcaacga 
cggcaagctc 
tggaaagagc 
tggaactggg 
cctggccggc 
tgacaaaact 
aagcacaagg 
agccctgaag 
aggcatcaac 
tggtagcaag 
tggcttgaaa 
aactaccaaa 
caagccacag 
caggaaacga 
tgagaaagac 
tttacctggc 
cctggctggc 
gaaaactacc 
cacaaagcaa 
actcaggaaa 
tgatgagaaa 
aaatttacct 
ggacctggtt 
tgatgacaaa 
aagaagctcc 
cctagcagtc 
aacaggagat 
agcagcaagt 
tctagaagac 
aatgactatt 
cactgccacg 
gctctcagca 
accagcaagc 
cccagtagaa 
cctggaagac 
actgactgct 
caccacagca 
agagccttca 
accagcaggt 
tccagcagca 
agccatagaa 
gactgatgac 
aacaagctca 
gttagcagtt 
ggtaggtgag 
agaagatgta 



gtgcagaaac 
aaggaaaagg 
ggtcacactg 
caaccagacc 
aaagtgggcg 
actacacaca 
tctccaaagc 
actcctaagg 
acaccaagtc 
gcttcacagc 
ctcaggaaag 
ggcaaagcca 
ttgggaactc 
ctacaaactc 
ttccagacac 
tgcaaatctc 
agaagcctca 
tcagcaggca 
acatttgtgg 
agacggccac 
gagctcttcc 
gtatcctgca 
ctcaagatat 
acacagacgt 
atcaaagcgt 
atggagaggt 
ttcaaagagc 
accaaaatag 
aggcggccca 
cagctcacac 
gtgttcaggg 
aggcagccaa 
gagctcttcc 
atagcctgca 
tccaagagaa 
acaccatcag 
atgaaagcat 
agcaaaagat 
ttcaaagagc 
aaaatagcct 
cggcccaaga 
cgaacaccat 
aatatcaaca 
ggcagcaaga 
ggcttcaaag 
atcacagaag 
aagcaaaggc 
agcaagctca 
agtaagagca 
gtaactggta 
ctggttgact 
gacaaaaaca 
agcacaaaga 
gttgagaggc 
ggtgatgagg 
gaggaaccca 
ctggccggct 
ggcaaagcca 
agcacaaaga 
gcagtcaagt 
gaagataaag 
agtgtaactg 
gacctagctg 
aaaaccacta 
aagagacggc 
ggcaagctca 
ggcaaaggca 
attggcagca 
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tggacctgac agagaactta actggcagca 
cccaggctct agaagacctg gctggcttta 
aggaatcaat gactaacgat aaaactgcca 
tagacaaaaa cccagcaagc tccaagcgac 
tgaaagaaga gctcctagca gttggcaagc 
cacacacaga gccaacagga gatggtaaga 
agatcttaga ctcagcagca agtctaactg 
gaaagtctga agtccctgaa gacctggccg 
acactaagga atcaatgact aatgaaaaaa 
cagacctagt ggacacccca acaagctcca 
cagacactga agaagaattt ttagcattta 
tgcacacacc caaaccagca gtaggtgaag 
cagtgcagaa actggaccag ccaggaaatt 
gtaaggaaaa ggcccaggct ctagaagaac 
catgcactga taaccccaca gctgatgaga 
cgcaatcaga cccagcggac accccaacaa 
agaaagcaga cgtagaggaa gaatttttag 
aagccatgca cacgcctaaa gcagcagtag 
ggactccagt ggagaaactg gacctgctag 
aaactcctaa agaaaaggcc aaggctctag 
agacaccagg tcacactgag gaatcaatga 
aatctccaca accagaccca gtcaaaaccc 
ccttggggaa agtaggtgtg aaagaagagg 
cagggaagac cacacagaca cacagagaga 
ttaaggaatc tgcaaagcag atgctggacc 
ggccaagaac acctaaggaa gaggcccaat 
tcttccagac accagaccac actgaggaat 
cctgcaaatc tccaccacca gaatcaatgg 
aaacaccttt ggggaaaagg gatatagtgg 
agaccacaca cacagacaaa gtaccaggag 
aaactgcaaa acagaaactg gacccagcag 
gaactcctaa gggaaaagcc caacccctag 
agacaccagt atgcactgac aagcccacga 
gatctccaca accagaccca gtgggtaccc 
gtctcaggaa agcagacgta gaggaagaat 
tagggaaagc tatggacaca cccaaaccag 
ttatgggaac tccagtgcag aaattggacc 
ggccacaaac tcctaaggaa aaggcccagg 
tcttccagac accaggcact gacaagccca 
gcaaatctcc acaaccagac ccagtggaca 
gaaacctcag gaaagcagac gtagaggaag 
cagcaggcaa agccatggac accccaaaac 
catttgtgga aactccagtg cagaaactgg 
gacagccaca gactcctaag gaaaaggctg 
aactcttcca gacaccaggt cacactgagg 
tatcctgtaa atctccacag ccagagtcat 
tcaagatacc cctggtgaaa gtggacatga 
cacggacatc aggggagact acgcaaacac 
tcaaagcgtt taaggagtct ccaaagcaga 
gcaggaggca gctgagaact cgtaaggaaa 
tcaaagagct cttctcagca ccaggtcaca 
caaaaattcc ctgcaaatct cccccaccag 
gatgccccaa gacacgtccc aggaaagaag 
tcacgcaaac atcagggcaa agcacacaca 
gcatcaaagt attgaagcaa cgtgcaaaga 
gcaggagaag gccaagagca cctaaggaaa 
tcacagagct ctctgaaaca tcaggtcaca 
ctaaaatacc ctgcgaatct cccccactag 
ggcatctcag gacacgtgtg cagaaggtac 
tcacacaaac atcaggggaa accacggatg 
gcatcaaagc attgaaggaa tctgcaaaac 
gcagcaggag acggccaaga gcacccaggg 
gcttcaaaga cccagcagca ggtcacactg 
aaataccctg caaatcatca ccagaactag 
ccaggacacg tgcccagaaa gtagaagtga 
cacaaacctc aggggagacc acgcacaccg 
cgaaagcatt taagcaacct gcaaagcgga 
ggagacagcc aagagcacct aaggaaaagg 
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agagacggct 
aagagctctt 
aagtagcctg 
ggctcaagac 
tcacacagac 
gcatgaaagc 
gcagcaagag 
gcttcatcga 
ctaccaaagt 
agccacagcc 
ggaaacaaac 
agaaagacat 
tacctggcag 
tgactggctt 
aaactaccaa 
acacaaagca 
cattcaggaa 
gtgaagagaa 
gaaatttacc 
aagatctggc 
ccgatgacaa 
caacaagctc 
tcctaccagt 
cagcaggaga 
cagcaaacta 
cactagaaga 
caacaactga 
acactccaac 
aagagctctc 
atgaggataa 
caagtgtaac 
aagacttggc 
ctcacgagaa 
caacaatctt 
ccttagcact 
caggaggtga 
tgccaggaaa 
ctctagaaga 
cgactgatga 
ccccagcaag 
aatttttagc 
cagcagtaag 
acctgctagg 
aggctctaga 
aatcaatgac 
tcaaaacctc 
aagaagagcc 
acacagagcc 
tcctggaccc 
aggcccgtgc 
ctgaagagtc 
aactaacaga 
taaaagagga 
cacacaaaga 
agaaaccaaa 
aggcccaacc 
ctcaggaatc 
aagtggtaga 
aagtaaaaga 
cagacaaaga 
agacaccggc 
aaagtgccca 
aagaatcaat 
aagacaccgc 
aggaggagct 
acaaagagcc 
acgtggacgc 
cccaacccct 



4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
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qqaaqacctg gccagcttcc aagagctctc tcaaacacca ggccacactg aggaactggc 9000 
aaatggtgct gctgatagct ttacaagcgc tccaaagcaa acacctgaca gtggaaaacc 9060 
tctaaaaata tccagaagag ttcttcgggc ccctaaagta gaacccgtgg gagacgtggt 9120 
aagcaccaga gaccctgtaa aatcacaaag caaaagcaac acttccctgc ccccactgcc 9180 
cttcaagagg ggaggtggca aagatggaag cgtcacggga accaagaggc tgcgctgcat 9240 
gccagcacca gaggaaattg tggaggagct gccagccagc aagaagcaga gggttgctcc 9300 
cagggcaaga ggcaaatcat ccgaacccgt ggtcatcatg aagagaagtt tgaggacttc 9360 
tgcaaaaaga attgaacctg cggaagagct gaacagcaac gacatgaaaa ccaacaaaga 9420 
ggaacacaaa ttacaagact cggtccctga aaataaggga atatccctgc gctccagacg 9480 
ccaagataag actgaggcag aacagcaaat aactgaggtc tttgtattag cagaaagaat 9540 
agaaataaac agaaatgaaa agaagcccat gaagacctcc ccagagatgg acattcagaa 9600 
tccagatgat ggagcccgga aacccatacc tag^gacaaa gtcactgaga acaaaaggtg 9660 
cttgaggtct gctagacaga atgagagctc ccagcctaag gtggcagagg agagcggagg 9720 
gcagaagagt gcgaaggttc tcatgcagaa tcagaaaggg aaaggagaag caggaaattc 9780 
agactccatg tgcctgagat caagaaagac aaaaagccag cctgcagcaa gcactttgga 9840 
gagcaaatct gtgcagagag taacgcggag tgtcaagagg tgtgcagaaa atccaaagaa 9900 
ggctgaggac aatgtgtgtg tcaagaaaat aacaaccaga agtcataggg acagtgaaga 9960 
tatttgacag aaaaatcgaa ctgggaaaaa tataataaag ttagttttgt gataagttct 10020 
agtgcagttt ttgtcataaa ttacaagtga attctgtaag taaggctgtc agtctgctta 10080 
agggaagaaa actttggatt tgctgggtct gaatcggctt cataaactcc actgggagca 10140 
ctgctgggct cctggactga gaatagttga acaccggggg ctttgtgaag gagtctgggc 10200 
caaggtttgc cctcagcttt gcagaatgaa gccttgaggt ctgtcaccac ccacagccac 10260 
cctacagcag ccttaactgt gacacttgcc acactgtgtc gtcgtttgtt tgcctatgtt 10320 
ctccagggca cggtggcagg aacaactatc ctcgtctgtc ccaacactga gcaggeactc 10380 
ggtaaacacg aatgaatgga taagcgcacg gatgaatgga gcttacaaga tctgtctttc 10440 
caatggccgg gggcatttgg tccccaaatt aaggctattg gacatctgca caggacagtc 10500 
ctatttttga tgtcctttcc tttctgaaaa taaagttttg tgctttggag aatgactcgt 10560 
gagcacatct ttagggacca agagtgactt tctgtaagga gtgactcgtg gcttgccttg 10620 
gtctcttggg aatacttttc taactagggt tgctctcacc tgagacattc tccacccgcg 10680 
qaatctcagg gtcccaggct gtgggccatc acgacctcaa actggctcct aatctccagc 10740 
tttcctgtca ttgaaagctt cggaagttta ctggctctgc tcccgcctgt tttctttctg 10800 
actctatctg gcagcccgat gccacccagt acaggaagtg acaccagtac tctgtaaagc 10860 
atcatcatcc ttggagagac tgagcactca gcaccttcag ccacgatttc aggatcgctt 10920 
ccttgtgagc cgctgcctcc gaaatctcct ttgaagccca gacatctttc tccagcttcaL 10980 
gacttgtaga tataactcgt tcatcttcat ttactttcca ctttgccccc tgtcctctct 11040 
gtgttcccca aatcagagaa tagcccgcca tcccccagat cacctgtctg gattcctccc 11100 
cattcaccca ccttgccagg tgcaggtgag gatggtgcac cagacagggt agctgtcccc 11160 
caaaatgtgc cctgtgcggg cagtgccctg tctccacgtt tgtttcccca gtgtctggcg 11220 
gggagccagg tgacatcata aatacttgct gaatgaatgc agaaatcagc ggtactgact 11280 
tgtactatat tggctgccat gatagggttc tcacagcgtc atccatgatc gtaagggaga 11340 
atgacattct gcttgaggga gggaatagaa aggggcaggg aggggacatc tgagggcttc 11400 
acagggctgc aaagggtaca gggattgcac cagggcagaa caggggaggg tgttcaagga 11460 
agagtggctc ttagcagagg cactttggaa ggtgtgaggc ataaatgctt ccttctacgt 11520 
aggccaacct caaaactttc agtaggaatg ttgctatgat caagttgttc taacacttta 11580 
gacttagtag taattatgaa cctcacatag aaaaatttca tccagccata tgcctgtgga 11640 
gtggaatatt ctgtttagta gaaaaatcct ttagagttca gctctaacca gaaatcttgc 11700 
tgaagtatgt cagcaccttt tctcaccctg gtaagtacag tatttcaaga gcacgctaag 11760 
ggtggttttc attttacagg gctgttgatg atgggttaaa aatgttcatt taagggctac H820 
ccccgtgttt aatagatgaa caccacttct acacaaccct ccttggtact gggggaggga 11880 
gagatctgac aaatactgcc cattccccta ggctgactgg atttgagaac aaatacccac 11940 
ccatttccac catggtatgg taacttctct gagcttcagt ttccaagtga atttccatgt 12000 
aataggacat tcccattaaa tacaagctgt ttttactttt tcgcctccca gggcctgtgc 12060 
gatctggtcc cccagcctct cttgggcttt cttacactaa ctctgtacct accatctcct 12120 
gcctccctta ggcaggcacc tccaaccacc acacactccc tgctgttttc cctgcctgga 12180 
actttcccac cagccccacc aagatcattt catccagtcc tgagctcagc ttaagggagg 12240 
cttcttgcct gtgggttccc tcacccccat gcctgtcctc caggctgggg caggttctta 12300 
gtttgcctgg aattgttctg tacctctttg tagcacgtag tgttgtgaaa ctaagccact 12360 
aattgagttt ctggctcccc tcctggggtt gtaagttttg ttcattcatg agggccgact 12420 
gtatttcctg gttactgtat cccagtgacc agccacagga gatgtccaat aaagtatgtg 12480 
atgaaatggt cttaaaaaaa aaaaaaaaaa aaaaa 12515 



<210> 314 

<211> 2444 

<212> DNA 

<213> Homo sapiens 
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<400> 314 
ggcacgaggc 
tcttcccggc 
gctccgcggg 
tcaagatcaa 
ctcctcaccg 
aggctgagct 
tgaagaagcg 
tccaagttcc 
tccaggtgac 
atcccatctt 
gaaactctgg 
aagaggacat 
, aagctgatgt 
gcctgcaggc 
gtgagcccat 
aacgtaaaag 
ccaccgaccc 
tccccaagcc 
atgagtttcc 
cggcccctcc 
cagctctggc 
tggccccacc 
tgcagctgca 
ctgtgttcac 
gcatacctgt 
ctcgcctagt 
ggctccccaa 
tctcagccct 
ctggttgcag 
aactgccccc 
ttgtcagtat 
ctggaaaggg 
tctctgtagg 
agacagaagc 
atcatggatt 
ctggcattgt 
gtctctgccg 
ctctctccag 
tgccttaata 
caggaggcat 
aactaataaa 



ggggccgggt 
agagcagccc 
cagcatccca 
tggctacaca 
gcctcacccc 
ctgcceggac 
ggacctggag 
tatagaagag 
agtgcgggac 
tgacaatcgt 
cagctgcctc 
tgaggtgtat 
gcaccgacaa 
tcctgtgcgt 
ggaattccag 
gacatatgag 
ccggcctcca 
agcaccccag 
caccatggtg 
ccaagtcctg 
ccaggcccca 
tgcccccaag 
gtttgatgat 
agacctggca 
ggccccccac 
gacagcccag 
tggcctcctt 
gctgagtcag 
gggattgaag 
aactttgtgg 
ctgtatctct 
gggagctggg 
gaactgtggg 
aggctggagg 
cattacagct 
ccctgtgcct 
gctctttcct 
gatccagaag 
gtagggtaag 
agtttttagt 
gctgttgcca 



cgcagctggg 
aagcagcggg 
ggcgagagga 
ggaccaggga 
cacgagcttg 
cgctgcatcc 
caggctatca 
cagcgtgggg 
ccatcaggca 
gcccccaaca 
ggtggggatg 
ttcacgggac 
gtggccattg 
gtctccatgc 
tacctgccag 
accttcaaga 
cctcgacgca 
ccctatccct 
tttccttctg 
ccccaggctc 
gcccctgtcc 
cccacccagg 
gaagacctgg 
tccgtcgaca 
acaactgagc 
aggccccccg 
tcaggagatg 
atcagctcct 
ccctccaaaa 
atgtcttcct 
ctctcttttt 
gaaactcaaa 
gtcccccatc 
taaggccttt 
taatcaaaat 
aacaccagcg 
tgctcaacca 
gggtttggtc 
ttgttaagag 
gaacaatcaa 
agctggacgg 



cccgcggcat 
gcatgcgctt 
gcacagatac 
cagtgcgcat 
taggaaagga 
acagtttcca 
gtcagcgcat 
actacgacct 
ggcccctccg 
ctgccgagct 
agatcttcct 
caggctggga 
tgttccggac 
agctgcggcg 
atacagacga 
gcatcatgaa 
ttgctgtgcc 
ttacgtcatc 
ggcagatcag 
cagcccctgc 
cagtcctagc 
ctggggaagg 
gggccttgct 
actccgagtt 
ccatgctgat 
acccagctcc 
aagacttctc 
aagggggtga 
gcacttacgg 
tggagggggg 
ggaggtgctt 
cttttcccct 
cccatcctcc 
gagcccacaa 
aacgccccag 
tttgaggggc 
tggctgaagg 
tggacttcct 
tgggggagag 
agcacttgga 
cacgagctcg 



ggacgaactg 
ccgctacaag 
caccaagacc 
ctccctggtc 
ctgccgggat 
gaacctggga 
ccagaccaac 
gaatgctgtg 
cctgccgcct 
caagatctgc 
actgtgtgac 
ggcccgaggc 
ccctccctac 
gccttccgac 
tcgtcaccgg 
gaagagtcct 
ttcccgcagc 
cctgagcacc 
ccaggcctcg 
ccctgctcca 
cccaggccct 
aacgctgtca 
tggcaacagc 
tcagcagctg 
ggagtaccct 
tgctccactg 
ctccattgcg 
cgcctgccct 
attctggtgg 
agccatattt 
aagcagaagc 
gtcctgatgg 
agcttctggt 
agccttatca 
ataccagccc 
tgccttcctg 
aaacagtgca 
tgctctcccc 
caggctggca 
ctcttgctct 
tgcc 



ttccccctca 
tgcgaggggc 
caccccacca 
accaaggacc 
ggcttctatg 
atccagtgtg 
aacaacccct 
cggctctgct 
gtcctttctc 
cgagtgaacc 
aaggtgcaga 
tccttttcgc 
gcagacccca 
cgggagctca 
attgaggaga 
ttcagcggac 
tcagcttctg 
atcaactatg 
gccttggccc 
gccatggtat 
cctcaggctg 
gaggccctgc 
acagacccag 
ctgaaccagg 
gaggctataa 
ggggccccgg 
gacatggact 
ccecagagca 
ggtgtgttcc 
tattctttta 
attaacttct 
tcagetccct 
actctcctag 
agtgtcttcc 
ctgtatggca 
ccctacagag 
acagcactgg 
tcttctcaag 
gctctccagt 
ttctactctg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2444 



<210> 315 
<211> 732 
<212> DNA 

<213> Homo sapiens 



<400> 315 

tgctgcgaac 

aaacctaagc 

ccagatgggg 

ttccgccttg 

gttgacctga 

gtagttgcca 

gagaccaacc 

ggcaggaaca 

tggtttcacc 

tgacaggagg 

ggaccaggct 

tcttggagct 

cttcctccca 



cacgtgggtc 
agctggaagg 
tccagcgggg 
ttggtctgaa 
aggaccgtcc 
tggtctggga 
ctgcagactc 
ttatacatgg 
ctgaggaact 
gcagaccaca 
gtaggaaatc 
gtgagttctc 



ccgggcgcgt 
aaccatggcc 
tcttgtggga 
attcatgcaa 
attctttgcc 
ggggctgaat 
caagcctggg 
cagtgattct 
ggtagattac 
ttgcttttca 
tagttattta 
cctgtacagt 



ttcgggtgct 
aactgtgagc 
gagattatca 
gcttccgaag 
ggcctggtga 
gtggtgaaga 
accatccgtg 
gtggagagtg 
acgagctgtg 
catccatttc 
caggaacttc 
gttaccatcc 



ggcggctgca 
gtaccttcat 
agcgttttga 
atcttctcaa 
aatacatgca 
cgggccgagt 
gagacttctg 
cagagaagga 
ctcagaactg 
ccctccttcc 
atcataattt 
ccgaccatct 



gccggagttc 60 
tgcgatcaaa 120 
gcagaaagga 180 
ggaacactac 240 
ctcagggccg 300 
catgctcggg 360 
catacaagtt 420 
gatcggcttg 480 
gatctatgaa 540 
catgggcaga 600 
ggagggaagc 660 
gattaaaatg 720 
732 



<210> 316 
<211> 2422 
<212> DNA 

<213> Homo sapiens 
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<400> 316 

gtcagcctcc cttccaccgc catattgggc cactaaaaaa agggggctcg tcttttcggg 60 
gtgtttttct ccccctcccc tgtccccgct tgctcacggc tctgcgactc cgacgccggc 120 
aagqtttgga gagcggctgg gttcgcggga cccgcgggct tgcacccgcc cagactcgga 180 
cgggctttgc caccctctcc gcttgcctgg tcccctctcc tctccgccct cccgctcgcc 240 
agtccatttg atcagcggag actcggcggc cgggccgggg cttccccgca gcccctgcgc 300 
gctcctagag ctcgggccgt ggctcgtcgg ggtctgtgtc ttttggctcc gagggcagtc 360 
gctgggcttc cgagaggggt tcgggccgcg taggggcgct ttgttttgtt cggttttgtt 420 
tttttgagag tgcgagagag gcggtcgtgc agacccggga gaaaigatgtc aaacgtgcga 480 
gtgtctaacg ggagccctag cctggagcgg atggacgcca ggcaggcgga gcaccccaag 540 
ccctcggcct gcaggaacct cttcggcccg gtggaccacg aagagttaac ccgggacttg 600 
gagaagcact gcagagacat ggaagaggcg agccagcgca agtggaattt cgattttcag 660 
aatcacaaac ccctagaggg caagtacgag tggcaagagg tggagaaggg cagcttgccc 720 
gagttctact acagaccccc gcggcccccc aaaggtgcct gcaaggtgcc ggcgcaggag 780 
agccaggatg tcagcgggag ccgcccggcg gcgcctttaa ttggggctcc ggctaactct 840 
gaggacacgc atttggtgga cccaaagact gatccgtcgg acagccagac ggggttagcg 900 
gagcaatgcg caggaataag gaagcgacct gcaaccgacg attcttctac tcaaaacaaa 960 
agagccaaca gaacagaaga aaatgtttca gacggttccc caaatgccgg ttctgtggag 1020 
cagacgccca agaagcctgg cctcagaaga cgtcaaacgt aaacagctcg aattaagaat 1080 
atgtttcctt gtttatcaga tacatcactg cttgatgaag caaggaagat atacatgaaa 1140 
attttaaaaa tacatatcgc tgacttcatg gaatggacat cctgtataag cactgaaaaa 1200 
caacaacaca ataacactaa aattttaggc actcttaaat gatctgcctc taaaagcgtt 1260 
ggatgtagca ttatgcaatt aggtttttcc ttatttgctt cattgtacta cctgtgtata 1320 
tagtttttac cttttatgta gcacataaac tttggggaag ggagggcagg gtggggctga 1380 
ggaactgacg tggagcgggg tatgaagagc ttgctttgat ttacagcaag tagataaata 1440 
rttgacttgc atgaagagaa gcaattttgg ggaagggttt gaattgtttt etttaaagat 1500 
qtaatgtccc tttcagagac agctgatact tcatttaaaa aaatcacaaa aatttgaaca 1560 
ctggctaaag ataattgcta tttattttta caagaagttt attctcattt gggagatctg 1620 
gtgatctccc aagctatcta aagtttgtta gatagctgca tgtggctttt ttaaaaaagc 1680 
aacagaaacc tatcctcact gccctcccca gtctctctta aagttggaat ttaccagtta 1740 
attactcagc agaatggtga tcactccagg tagtttgggg caaaaatccg aggtgcttgg 1800 
gagttttgaa tgttaagaat tgaccatctg cttttattaa atttgttgac aaaattttct 1860 
cattttcttt tcacttcggg ctgtgtaaac acagtcaaaa taattctaaa tccctcgata 1920 
tttttaaaga tctgtaagta acttcacatt aaaaaatgaa atatttttta atttaaagct 1980 
tactctgtcc atttatccac aggaaagtgt tatttttaaa ggaaggttca tgtagagaaa 2040 
agcacacttg taggataagt gaaatggata ctacatcttt aaacagtatt tcattgcctg 2100 
tgtatggaaa aaccatttga agtgtacctg tgtacataac tctgtaaaaa cactgaaaaa 2160 
ttatactaac ttatttatgt taaaagattt tttttaatct agacaatata caagccaaag 2220 
tggcatgttt tgtgcatttg taaatgctgt gttgggtaga ataggttttc ccctcttttg 2280 
ttaaataata tggctatgct taaaaggttg catactgagc caagtataat tttttgtaat 2340 
gtgtgaaaaa gatgccaatt attgttacac attaagtaat caataaagaa aacttccata 2400 
gctaaaaaaa aaaaaaaaaa aa 2422 

<210> 317 
<211> 5061 
<212> DNA 

<213> Homo sapiens 
<400> 317 

atggctcaga tatttagcaa cagcggattt aaagaatgtc cattttcaca tccggaacca 60 
acaagagcaa aagatgtgga caaagaagaa gcattacaga tggaagcaga ggctttagca 120 
aaactgcaaa aggatagaca agtgactgac aatcagagag gctttgagtt gtcaagcagc 180 
accagaaaaa aagcacaggt ttataacaag caggattatg atctcatggt gtttcctgaa 240 
tcagattccc aaaaaagagc attagatatt gatgtagaaa agctcaccca agctgaactt 300 
gagaaactat tgctggatga cagtttcgag actaaaaaaa cacctgtatt accagttact 360 
cctattctga gcccttcctt ttcagcacag ctctatttta gacctactat tcagagagga 420 
cagtggccac ctggattacc tgggccttcc acttatgctt taccttctat ttatccttct 480 
acttacagta aacaggctgc attccaaaat ggcttcaatc caagaatgcc cacttttcca 540 
tctacagaac ctatatattt aagtcttccg ggacaatctc catatttctc atatcctttg 600 
acacctgcca caccctttca tccacaagga agcttaccta tctatcgtcc agtagtcagt 660 
actgacatgg caaaactatt tgacaaaata gctagtacat cagaattttt aaaaaatggg 720 
aaagcaagga ctgatttgga gataacagat tcaaaagtca gcaatctaca ggtatctcca 780 
aagtctgagg atatcagtaa atttgactgg ttagacttgg atcctctaag taagcctaag 840 
gtggataatg tggaggtatt agaccatgag gaagagaaaa atgtttcaag tttgctagca 900 
aaggatcctt gggatgctgt tcttcttgaa gagagatcga cagcaaattg tcatcttgaa 960 
agaaaggtga atggaaaatc cctttctgtg gcaactgtta caagaagcca gtctttaaat 1020 
attcgaacaa ctcagcttgc aaaagcccag ggccatatat ctcagaaaga cccaaatggg 1080 
accagtagtt tgccaactgg aagttctctt cttcaagaag ttgaagtaca gaatgaggag 1140 
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atggcagctt 
cgcacaaacc 
aatgctagtg 
tgtgatgtga 
gacttgaatc 
ctgeagaata 
acagaaatta 
gcagaagatg 
tgcaaagaag 
gtagaactgg 
gtaagaaaaa 
aagaagctaa 
tttggaggag 
gttcaagtaa 
aattctggta 
actacaacag 
gtatcaaatt 
tttaaaccta 
tgggatgaac 
caccttactc 
cagagaaagg 
tttttaacat 
cctggaacag 
ccttctcctg 
caacataact 
aaagactcat 
tattgcttca 
tggggtaatc 
attgcattgg 
acctggattg 
gctttgaaat 
ttgggaaata 
gtacagttta 
cgacttagag 
gaaaaagtaa 
cgagtacagt 
gcaaaagaat 
gtcacaatgg 
gaagatcttc 
cttaaagaag 
cgaggcatgg 
ggtgtgacag 
tctgaagaag 
gtagccacct 
acgggacaca 
agcttcaaaa 
gggggtgaaa 
aacttgataa 
gggttaccag 
caaactacag 
agcattgcca 
cttccttcta 
gatggtcgaa 
cattatattt 
cgaacatttg 
aagttaccag 
gccaaaagga 
gtagcagagt 
gaagggatag 
ggagctgtga 
atcaaagatc 
cttccagata 
ccgacattca 
gaacttcaac 
gtaaccctgc 
actgcggcaa 



tttgtcgatc 
eaggctattt 
tgaaggtctc 
gttctactgt 
aagtagatgt 
atcattgcct 
gactacaact 
atgaaacacc 
ccatgacgag 
ctcttcaaat 
tctgtagtgc 
agagagcagt 
aagacactag 
gcataaacca 
ggagtcctac 
agcagctcca 
atgaaaaata 
ttcaatcaaa 
taatcatttt 
tttttggaat 
gaccagaagc 
gtggaactaa 
ttaccaaaaa 
catttgatat 
tagaaacact 
cacttggact 
aacacccaaa 
ttgccaaaac 
aacttcttga 
aggccattag 
atgaaattta 
tccagatagc 
gtacccgata 
aagaacttct 
ggcaggctag 
ccttttttca 
taaatattaa 
tgaatgctga 
ggcaagatat 
gactagatct 
tggagctggt 
gatcctttaa 
aatatgaaaa 
atgttttagg 
tgtttcacat 
gggategggc 
agcccaccat 
gaaagcagac 
aacttacaag 
acgcagaagc 
caaagtttaa 
atgatgagcc 
tcaaggaagt 
atgtagtccg 
tcgaatttca 
gctttcctaa 
aaattgagtt 
gtgatcttgt 
ctaggtctgc 
aattatccat 
ttgttactga 
accacaaaac 
atgaaatgct 
taagtgtact 
ctttgaaaga 
catacttgta 



39740-000lPCT.txt 
cattacaaaa ttgaagacca aatttccata 
qttaagtcca gtcacagcgc aaagaaacat 
cattgacatt gaaggatttc agctaccagt 
agaaatcatt ataatgcaag ccctttgctg 
tggcagctat gttctaaaag tttgtggtca 
tggaagtcat gagcatattc aaaactgtcg 
cttgaccttc agtgcaatgt gtcaaaatct 
cgtggattta aacaaacacc tgtatcaaat 
acaccctgtt gaagaactct tagattctta 
tgaaaaccaa caccgagcag tagatcaagt 
tttagatggt gtcgagactc ttgccattac 
taatctteca aggagtaaaa ctgctgatgt 
caggagttca actaggggct cacttaatce 
attaactgca gcaatttatg atcttctcag 
agactgtgcc caaagtagca agagtgtcaa 
qtttactatt tttgctgctc atggaatttc 
ctacttgata tgttcactgt ctcacaatgg 
qaaggttggc acttacaaga atttcttcta 
tcctatccag atatcacaat tgccattaga 
tttaaatcag agcagtggaa gttcccctga 
tttgggcaaa gtttctttac ctctttgtga 
acttctatat ctttggactt catcacatac 
aggatatgtc atggaaagaa tagtgctaca 
tatttataca actcctcaag ttgacagaag 
agagaatgat atiaaaaggga aacttcttga 
ttctaaagaa gataaagctt ttttatggga 
ttgtcttcct aaaatattag caagcgcccc 
ttactcattg cttcaccagt ggcctgcatt 
ttcaaaattt gctgatcagg aagtaagatc 
tgatgatgag ctaacagatc ttcttccaca 
cttgaatagt tcattagtgc aattcctttt 
acacaattta tattggcttc tcaaagatgc 
cgaacatgtt ttgggtgctc tcctgtcagt 
aaaacagacg aaacttgtac agcttttagg 
tggatcagcc agacaggttg ttctccaaag 
gaaaaataaa tgccgtctcc ctctcaagcc 
qtcgtgttcc ttcttcagtt ctaatgctgt 
ccctctggga gaagaaatta atgtcatgtt 
qttagcttta cagatgataa agattatgga 
gaggatggta attttcaaat gtctctcaac 
tcctgcttcc gataccctca ggaaaatcca 
agataaacca cttgcagagt ggctaaggaa 
qgcttcagag aactttatct attcctgtgc 
catctgtgat cgacacaatg acaatataat 
tgactttgga aagtttttgg gacatgcaca 
tccttttgtg ctgacctctg atatggcata 
tcgttttcag ttgtttgtgg acctctgctg 
aaaccttttt cttaacctcc tttcactgat 
tattcaagat ttgaaatacg ttagagatgc 
tacaattttc tttactaggc ttattgaatc 
cttcttcatt cacaaccttg ctcagcttcg 
catcctttca ttttcaccta aaacatactc 
ctctgttttt acatatcata agaaatacaa 
aattttgtgg gaaggacaga ttgaaccatc 
ggaacttcac aataagctca gtattatttt 
taqgatggtt ctaggaagaa cacacataaa 
aaacagttac ttacagagtt tgatgaatgc 
ttgtactttc ttccaccctt tacttcgtga 
agatgcaggt tccttcagtc ctactccagg 
ctcttaccga aatggtactc ttttcatcat 
agatggagct gacccaaatc catatgtcaa 
atccaaacgt aaaaccaaaa tttcacgaaa 
tgtatacagt ggatatagca aagaaaccct 
cagtgcagaa tctctgcggg agaatttttt 
tttcaacttg agcaaagaga cggttaaatg 



taccaatcac 
atgcggagaa 
tacttttacg 
ggtacatgat 
agaggaagtg 
aaaatgggac 
ggcccgaaca 
agaaaaacct 
tcacaaccaa 
aattaaagct 
agaatcagta 
gacttctttg 
tgaaaatcct 
actccatgca 
ggaagcatgg 
aagtaattgg 
aaaggatctt 
tcttattaaa 
atcagttctt 
ttctaataag 
ctttagacgg 
aaattctgtt 
ggttgatttt 
cattatacag 
tattcttcat 
gaaacgttat 
aaactggaaa 
gtacccacta 
cctagctgtg 
gtttgtacaa 
gtccagggca 
cctgcatgat 
aggaggaaaa 
aggagtagca 
aagtatggaa 
aagtctagtg 
ccccctaaaa 
taaggttggt 
taagatctgg 
tggcagagat 
agtggaatat 
atacaatccc 
tggatgctgt 
gcttcgaagc 
gatgtttggc 
tgtcattaat 
tcaggcctac 
gattccttca 
acttcaaccc 
aagtttggga 
tttttctggt 
ctttagacaa 
cccagataaa 
atttgtcttc 
tccactttgg 
agatgtagca 
ttcaacggat 
tgagaaagct 
ccaaatagga 
ggtgatgcat 
aacataccta 
aacgaggaat 
aagacagcga 
cttgggtgga 
gtatcagctg 



1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5061 



page 65 



SUBSTITUTE SHEET (RULE 26) 



BNSDOCID: <WO_ 



_0307B662A1J_> 



WO 03/078662 



PCT/US03/07713 



39740-000lPCT.txt 



<210> 318 
<211> 3014 
<212> DNA 

<213> Homo sapiens 



<400> 318 
ctgaccagcg 
cacattcaac 
tcccttttcc 
tgagctgaag 
ggtcggatcc 
ggacaccttg 
gccctgccag 
cgtggagggc 
tcccccagaa 
aggtcccggg 
gtttggcccc 
cccgctcatg 
taaagtgctg 
ccctcactgg 
ggaggaggat 
ggctctgggt 
aggaggcgtc 
ggtggagcag 
tttcatccac 
gctgctggaa 
gcggagttca 
gtacccgccc 
gcccgctcta 
ttcctacctt 
gcccccgctg 
acccgccagt 
gtacaaagcg 
gccgggcgcg 
cgccgccgcc 
gctcggctac 
caactacctg 
acctcagaag 
tacctgtggg 
atgtgctgga 
tcgccttaga 
caataaagtc 
tccaaatgaa 
gttgattcca 
acatgacaac 
cgagaggcaa 
acatattgat 
tctaggatgg 
aatactaaat 
gcagatccca 
agtattgtta 
ggagatgagg 
agttgtgtcg 
tcttgtcaaa 
tgttgaattt 
agggatggtg 
attaaatttt 



ccgccctccc 
acccactttc 
ctcctccctg 
gcaaagggtc 
ccactgctgt 
cctgaagttt 
ggacaggacc 
gcatattcca 
aaggacagcg 
cagagccaac 
gaacttcccg 
agccggtccg 
ccccggggcc 
tccggggccc 
ggctctgagt 
ggcgcggcgg 
gccctggtcc 
gacgcgccga 
gtgcctatcc 
gacgaaagtt 
ccctgtgcct 
gacgccgagc 
aagataaagg 
gtggccggtg 
cxgccgcgag 
gcctcagtct 
gagggcgcgc 
agcggctgcc 
ggggcggccc 
caggccgccg 
aggccggatt 
atttgtttaa 
agctgtaagg 
agaaatgact 
aagtgctgtc 
agagttgtga 
agccaagccc 
ccactgatca 
acaaaacctg 
cttctttcag 
gaccagataa 
agatcctaca 
gaacagcgga 
caggagtttg 
cttcttaata 
tcaagctaca 
agctcacagc 
caacttcatc 
ccagaaatga 
aaaccccttc 
gtgg 



ccgcccccga 
tcctccctct 
gagacggggg 
cccgggctcc 
gtcgcccagc 
cggccatacc 
cctccgacga 
gagctgaagc 
gactgctgga 
ccagccctcc 
aagatccacc 
ggtgcaaggt 
tgtcaccagc 
cagtgaagcc 
ccgaggagtc 
ctggaggagg 
ccaaggaaga 
tggcgcccgg 
tgcctctcaa 
acgacggcgg 
cgtccacccc 
ccaaggacga 
aggaggagga 
ccaaccccgc 
cgaecccatc 
cgtctgcgtc 
cgccccagca 
tgctcccgcg 
ccgcgctcta 
tgctcaagga 
cagaagccag 
tctgtgggga 
tcttctttaa 
gcatcgttga 
aggctggeat 
gagcactgga 
taagccagag 
acctgttaat 
acacetccag 
tagtcaagtg 
ctctcattca 
aacacgtcag 
tgaaagaatc 
tcaagcttca 
caattccttt 
ttagagagct 
gtttctatca 
tgtactgctt 
tgtctgaagt 
tctttcataa 



cccaggaggt 
gcccctatat 
aggagaaaag 
ccacgtggcg 
cgcaggtccg 
tatctccctg 
aaagacgcag 
tacaaggggt 
cagtgtcttg 
cgcctgcgag 
ggctgccccc 
tggagacagc 
ccggcagctg 
gtctccgcag 
tgcgggtccg 
agccgcggct 
ttcccgcttc 
gcgctccccg 
tcacgcctta 
ggccggggct 
ggtcgctgta 
cgcgtaccct 
aggcgcggag 
agccttcceg 
cagacccggg 
ctcctcgggg 
gggcccgttc 
ggacggcctg 
ccctgcactc 
gggcctgccg 
ccagagccca 
tgaagcatca 
gagggcaatg 
taaaatccgc 
ggtccttgga 
tgctgttgct 
attcactttt 
gagcattgaa 
ttctttgctg 
gtctaaatca 
gtattcttgg 
tgggcagatg 
atcattctat 
agttagccaa 
ggaagggcta 
catcaaggca 
acttacaaaa 
gaatacattt 
tattgctgca 
aaagtgaatg 



ggagatccct 
tcccgaaacc 
gggagtccag 
ggcggcccgc 
ttcccgggga 
gacgggctac 
gaccagcagt 
gctggaggca 
gacactctgt 
gtcaccagct 
gccacccagc 
tccgggacgg 
ctgctcccgg 
gccgctgcgg 
cttctgaagg 
gtcccgccgg 
tcagcgccca 
ctggccacca 
ttggcagccc 
gccagcgcct 
ggcgacttcc 
ctctatagcg 
gcctccgcgc 
gatttcccgt 
gaagcggcgg 
tcgaccctgg 
gcgccgccgc 
ccctccacct 
ggcctcaacg 
caggtctacc 
caatacagct 
ggctgtcatt 
gaagggcagc 
agaaaaaact 
ggtcgaaaat 
ctcceacagc 
tcaccaggtc 
ccagatgtga 
acaagtctta 
ttgccaggtt 
atgagcttaa 
ctgtattttg 
tcattatgcc 
gaagagttcc 
cgaagtcaaa 
attggtttga 
cttcttgata 
atccagtccc 
caattaccca 
tcatcttttt 



ccggtccagc 
ccctcctcct 
tcgtcatgac 
cctcccccga 
gccagacctc 
tcttccctcg 
cgctgtcgga 
gcagttctag 
tggcgccctc 
cttggtgcct 
gggtgttgtc 
cagctgccca 
cctctgagag 
tggaggttga 
gcaaacctcg 
gggcggcagc 
gggtcgccct 
cggtgangga 
gcactcggca 
ttgccccgcc 
ccgactgcgc 
acttccagcc 
gctccccgcg 
tggggccacc 
tgacggccgc 
agtgcatcct 
cctgcaaggc 
ccgcctctgc 
ggctcccgca 
cgccctatct 
tcgagtcatt 
atggtgtcct 
acaactactt 
gcccagcatg 
ttaaaaagtt 
cagtgggcgt 
aagacataca 
tctatgcagg 
atcaactagg 
ttcgaaactt 
tggtgtttgg 
cacctgatct 
ttaccatgtg 
tctgtatgaa 
cccagtttga 
ggcaaaaagg 
acttgcatga 
gggcacrgag 
agatattggc 
cttttaaaga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3014 



<210> 319 
<211> 2148 
<212> DNA 

<213> Homo sapiens 

<400> 319 ^ 
qcttcagggt acagctcccc cgcagccaga agccgggcct gcagcgcctc agcaccgctc 60 
cgggacaccc cacccgcttc ccaggcgtga cctgtcaaca gcaacttcgc ggtgtggtga 120 
actctctgag gaaaaaccat tttgartatt actctcagac gtgcgtggca acaagtgact 180 
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gagacctaga 

acgaaggcgt 

cccacggaga 

tgccgccctg 

cgggagacac 

tctgggagtg 

tggacttgat 

ggatttacgg 

tctgtactca 

tggtttgagc 

cctcaaggaa 

gaaaaatgta 

tatcaagatg 

tacctggaag 

gcgtagactc 

gcagtatatc 

tgtggactct 

ccccttggaa 

gtcccagagt 

cgatgtaagt 

cctggtcttt 

gagccactgc 

cttgcagagt 

tgtccccctg 

tctgcatgcc 

tagtgccaac 

gtgcccctgt 

ttggacacta 

acaaatgttc 

gttcagtgag 

gtgatctttg 

gattctggct 

tgttgaaaat 



aatccaagcg 

ttgtggggtt 

cttgtggagc 

gagttgctgc 

agccagaccc 

ctgatgaagg 

gtgctccttg 

aagaactctc 

tttccagagc 

acagaggcag 

ggtgcctgtg 

ctacgcctgt 

atcctgaaaa 

ctacccacct 

ctcctctccc 

gcccagttca 

ttatttttcc 

accctctcaa 

cccagcgtca 

cccgagcccc 

gatgagtgtg 

tcccagctta 

ctcctgcagc 

gagagttatg 

aggcteaggg 

ccctgtcctc 

ttcatgccta 

aagccaggat 

agtgtgagtg 

gaaaaaaagg 

gggagataca 

tgggaagtac 

aaagagaagc 



39740-0001PCT.txt 
ttggaggtcc tgaggccagc ctaagtcgct tcaaaatgga 
ccattcagag ccgatacatc agcatgagtg tgtggacaag 
tggcagggca gagcctgctg aaggatgagg ccctggccat 
ccagggagct cttcccgcca ctcttcatgg cagcctttga 
tgaaggcaat ggtgcaggcc tggcccttca cctgcctccc 
gacaacatct tcacctggag accttcaaag ctgtgcttga 
cccaggaggt tcgccccagg aggtggaaae ttcaagtgct 
atcaggactt ctggactgta tggtctggaa acagggccag 
cagaagcagc tcagcccatg acaaagaagc gaaaagtaga 
agcagccctt cattccagta gaggtgctcg tagacctgtt 
atgaattgtt ctcctacctc attgagaaag tgaagcgaaa 
gctgtaagaa gctgaagatt tttgcaatgc ccatgcagga 
tggtgcagct ggactctatt gaagatttgg aagtgacttg 
tggcgaaatt ttctccttac ctgggccaga tgattaatct 
acatccatgc atcttcctac atttccccgg agaaggaaga 
cctctcagtt cctcagtctg cagtgcctgc aggctctcta 
ttagaggccg cctggatcag ttgctcaggc acgtgatgaa 
taactaactg ccggcttteg gaaggggatg tgatgcatct 
gtcagctaag tgtcctgagt ctaagtgggg tcatgctgac 
tccaagctct gctggagaga gcctctgcca ccctccagga 
ggatcacgga tgatcagctc cttgccctcc tgccttccct 
caaccttaag cttctacggg aattccatct ccatatctgc 
acctcatcgg gctgagcaat ctgacccacg tgctgtatcc 
aggacatcca tggtaccctc cacctggaga ggcttgccta 
agttgctgtg tgagttgggg cggcccagca tggtctggct 
actgtgggga cagaaccttc tatgacccgg agcccatcct 
actagctggg tgcacatatc aaatgcttca ttctgcatac 
gtgcatgcat cttgaagcaa caaagcagcc acagtttcag 
aggaaaacat gttcagtgag gaaaaaacat tcagacaaat 
ggaagttggg gataggcaga tgttgacttg aggagttaat 
tcttatagag ttagaaatag aatctgaatt tctaaaggga 
atgtaggagt taatccctgt gtagactgtt gtaaagaaac 
aatgtgaagc aaaaaaaaaa aaaaaaaa 



240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2148 



<210> 320 
<211> 540 
<212> DNA 
<213> Homo sapiens 



<400> 320 

atccctgact 

ggtgatctgc 

gacagagacg 

gccctcccag 

gtgcttctat 

tctgcaggga 

gagctcggct 

cacaacacag 

gatcgatatt 



cggggtcgcc 
gccctggtcc 
tgtacagtgg 
tgtgcaaata 
cctaatacca 
tctgcctgca 
gccacctcca 
attgactgct 
aaaaaaaaaa 



tttggagcag 
tggtgtccat 
ccccccgtga 
agggctgctg 
tcgacgtccc 
tcctgacggg 
ccggacacct 
ctgactttga 
aaaaaaaaaa 



agaggaggca 
gctggccctc 
aagacagaat 
tttcgacgac 
tccagaagag 
gtgccgtccc 
cagacacgct 
ctactcaaaa 
aaaaaaaaaa 



atggccacca 
ggcaccctgg 
tgtggttttc 
accgttcgtg 
gagtgtgaat 
cagcacggtg 
tctgcagctg 
ttggcctaaa 
aaaaaaaaaa 



tggagaacaa 60 
ccgaggccca 120 
ctggtgtcac 180 
gggtcccctg 240 
tttagacact 300 
attagtccca 360 
tgcctcggct 420 
aattaaaaga 480 
aaaaaaaaaa 540 



<210> 321 

<2U> 2346 

<212> DNA 

<213> Homo sapiens 



<400> 321 

gcacgaggct 

ccagccccgg 

ctggaccagc 

gaaagcatgg 

tttgaaatct 

gagctacttc 

acaggagcaa 

agaaatgcta 

catcccttca 

cttgagtatc 

gaagacactg 



gcggcgggtc 
acttccgaga 
cagaggacgc 
accatggggg 
cagaaactag 
gggtacttgg 
atactgggaa 
aagatacagc 
tcgtggattt 
tcagtggagg 
cctgctttta 



cgggcccatg aggcgacgaa ggaggcggga 
cagggaagct gaggacatgg caggagtgtt 
gggctctgag gatgagctgg aggagggggg 
agttggacca tatgaacttg gcatggaaca 
tgtgaacaga gggccagaaa aaatcagacc 
taaagggggc tatggaaagg tttttcaagt 
aatatttgcc atgaaggtgc ttaaaaaggc 
tcatacaaaa gcagaacgga atattctgga 
aatttatgcc ttteagactg gtggaaaact 
agaactattt atgcagttag aaagagaggg 
cttggcagaa atctccatgg ctttggggca 
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cggcttttac 60 
tgacatagac 120 
tcagttaaat 180 
ttgtgagaaa 240 
agaatgtttt 300 
acgaaaagta 360 
aatgatagta 420 
ggaagtaaag 480 
ctacctcatc 540 
aatatttatg 600 
tttacatcaa 660 
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<WO 03078662A1_L> 
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39740-OO0lPCT.txt 

aaggggatca tctacagaga cctgaagccg gagaatatca tgcttaatca ccaaggtcat 1 720 
gtgaaactaa cagactttgg actatgeaaa gaatctattc atgatggaae agtcacacac 780 
acattttgtg gaacaataga atacatggcc cctgaaatct tgatgagaag tggccacaat 840 
cgtgctgtgg attggtggag tttgggagca ttaatgtatg acatgctgac tggagcaccc 900 
ccattcactg gggagaatag aaagaaaaca attgacaaaa tcctcaaatg taaactcaat 960 
ttgcctccct acctcacaca agaagccaga gatctgctta aaaagctgct gaaaagaaat 1020 
gctgcttctc gtctgggagc tggtcctggg gacgctggag aagttcaagc tcatccattc 1080 
tttagacaca ttaactggga agaacttctg gctcgaaagg tggagccccc ctttaaacct 1140 
ctgttgcaat ctgaagagga tgtaagtcag tttgattcca agtttacacg tcagacacet 1200 
gtcgacagcc cagatgactc aactctcagt gaaagtgcca atcaggtctt tctgggtttt 1260 
acatatgtgg ctccatctgt acttgaaagt gtgaaagaaa agttttcctt tgaaccaaaa 1320 
atccgatcac ctcgaagatt tattggcagc ccacgaacac ctgtcagccc agtcaaattt 1380 
tctcctgggg atttctgggg aagaggtgct tcggccagca cagcaaatcc tcagacacet 1440 
gtggaatacc caatggaaac aagtggcata gagcagatgg atgtgacaat gagtggggaa 1500 
gcatcggcac cacttccaat acgacagccg aactctgggc catacaaaaa acaagctttt 1560 
cccatgatct ccaaacggcc agagcacctg egtatgaate tatgacagag caatgetttt 1620 
aatgaattta aggcaaaaag gtggagaggg agatgtgtga gcatcctgca aggtgaaaca 1680 
agactcaaaa tgacagtttc agagagtcaa tgtcattaca tagaacactt eggacacagg 1740 
aaaaataaac gtggatttta aaaaatcaat caatggtgca aaaaaaaact taaagcaaaa 1800 
tagtattget gaactcttag gcacatcaat taattgattc ctcgcgacat ctttctcaac 1860 
cttatcaagg attttcatgt tgatgactcg aaactgacag tattaagggt aggatgttgc 1920 
tctgaatcac tgtgagtctg atgtgtgaag aagggtatcc tttcattagg caagtacaaa, 1980 
ttgcctataa tacttgeaac taaggacaaa ttagcatgea agcttggtca aacttttccc 2040 
aggcaaaatg ggaaggcaaa gacaaaagaa acttaccaat tgatgtttta cgtgcaaaca 2100 
acctgaatct tttttttata taaatatata tttttcaaat agatttttga ttcagctcat 2160 
tatgaaaaac atcccaaact ttaaaatgcg aaattattgg ttggtgtgaa gaaagecaga 2220 
caacttctgt ttcttctctt ggtgaaataa taaaatgcaa atgaatcatt gttaacacag 2280 
ctgtggctcg tttgagggat tggggtggac ctggggttta ttttcagtaa cccagctgcg 2340 
gagect 2346 

<210> 322 

<211> 2420 

<212> DNA 

<213> Homo sapiens 

<400> 322 . • 

teeggggegg cccccggcag ccagcgcgac gttccaaaat cgaacctcag tggeggeget 60 
eggaagegga actctgccgg ggccgcgccg gctacattgt ttcctccccc cgactccctc 120 
ccgccccctt cccccgcctt tcttccctcc gcgacccggg ccgtgcgtcc gtccccctgc 180 
ctctgcctgg cggtccctcc tcccctctcc ttgcacccat acctctttgt accgcacccc 240 
ctggggaccc ctgcgcccct cccctccccc ctgaccgcat ggaccgtccc gcaggccgct 300 
gatgccgccc gcggcgaggt ggcccggacc gcagtgcccc aagagagctc taatggtacc 360 
aagtgacagg ttggctttac tgtgactcgg ggacgecaga gctcctgaga agatgtcagc 420 
aatacaggee gcctggccat ceggtacaga atgtattgcc aagtacaact tccacggcac 480 
tgccgagcag gacctgccct tetgeaaagg agaegtgetc accattgtgg ccgtcaccaa 540 
ggaccccaac tggtacaaag ccaaaaacaa ggtgggccgt gagggcatca tcccagccaa 600 
ctacgtccag aagegggagg gcgtgaaggc gggtaccaaa ctcagcctca tgccttggtt 660 
ccacggcaag ateacaeggg agcaggctga gcggcttctg tacccgccgg agacaggect 720 
gttcctggtg egggagagea ccaactaccc eggagactae acgctgtgcg tgagctgega 780 
eggcaaggtg gagcactacc gcatcatgta ccatgccagc aagctcagca tcgacgagga 840 
ggtgtacttt gagaacctca tgcagctggt ggagcactac acctcagacg cagatggact 900 
ctgtacgcgc ctcattaaac caaaggtcat ggagggcaca gtggcggccc aggatgagtt 960 
ctaccgcagc ggctgggccc tgaacatgaa ggagctgaag ctgetgeaga ecategggaa 1020 
gggggagttc ggagacgtga tgctgggcga ttaccgaggg aacaaagtcg ccgtcaagtg 1080 
cattaagaac gacgccactg cccaggcctt cctggctgaa gcctcagtca tgaegcaact 1140 
geggcatage aacctggtgc agctcctggg cgtgatcgtg gaggagaagg gegggctcta 1200 
catcgtcact gagtacatgg ccaaggggag ccttgtggac tacctgeggt ctaggggtcg 1260 
gtcagtgctg ggeggagact gtctcctcaa gttctegcta gatgtctgcg aggccatgga 1320 
atacctggag ggcaacaatt tegtgeateg agacctggct gcccgcaatg tgctggtgtc 1380 
tgaggacaac gtggccaagg teagegaett tggtctcacc aaggaggcgt ccagcaccca 1440 
ggacaeggge aagctgccag tcaagtggac agcccctgag gecctgagag agaagaaatt 1500 
ctccactaag tctgacgtgt ggagtttcgg aatccttctc tgggaaatct actcctttgg 1560 
gcgagtgcct tatccaagaa ttcccctgaa ggacgtcgtc cctcgggtgg agaagggcta 1620 
caagatggat gcccccgacg gctgcccgcc cgcagtctat gaagtcatga agaactgetg 1680 
gcacctggac gccgccatgc ggccctcctt cctacagctc cgagagcagc ttgagcacat 1740 
caaaacccac gagctgcacc tgtgacggct ggcctccgcc tgggtcatgg gcctgtgggg 1800 
actgaacctg gaagatcatg gacctggtgc ccctgctcac tgggeccgag cctgaactga 1860 
gccccagcgg getggeggge ctttttcctg cgtcccagcc tgcacccctc cggccccgtc 1920 
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tctcttggac 
cacggagcgg 
tcttagagtt 
atttttcaag 
atcttatttr 
ctcgccccag 
tgtttgcgct 
cttggatttc 
accacgccat 



39740-0001PCT.txt 
ccacctgtgg ggcctgggga gcccactgag gggccaggga ggaaggaggc 
pcqqcagcgc cccaccacgt cgggcttccc tggcctcccg ccactcgcct 
ttattccttt ccttttttga gatttttttt ccgtgtgttt attttttatt 
ataaggagaa agaaagtacc cagcaaatgg gcattttaca agaagtacga 
t cctgtcctg cccgtgaggt gggggggacc gggcccctct ctagggaccc 
cctcattccc cattctgtgt cccatgtccc gtgtctcctc ggtcgccccg 
tgaccatgtt gcactgtttg catgcgcccg aggcagacgt ctgtcagggg 
gtgtgccgct gccacccgcc cacccgcctt gtgagatgga atcgtaataa 
gaggaaaaaa 



1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2420 



<210> 323 
<211> 2253 
<212> DNA 

<213> Homo sapiens 

<400> 323 
ggaagacttg 
atctcagtgg 
caagtcccct 
gbgcctttgc 
tacagctaga 
tacagctcca 
tccattacct 
gcgcgttcct 
gcagaagcaa 
ccaaaagagc 
atcaaaacag 
tggtcgccct 
caagtttatt 
gcatcagctc 
actgtatggt 
tggaacagtt 
ttatataaca 
agacattaag 
tgggtggtca 
cctgccccct 
tggagttctt 
agagacctac 
agccagggac 
agaagtactt 
caaagaatca 
ggctgccata 
cctcaatcta 
ccctattcag 
aattgtgcta 
cagcctgtgc 
agtgaccact 
ctgagtgagt 
gtgattcttt 
ggcattcctt 
tagtcctcct 
acctccattt 
ggcttatttt 
aataagtatc 



ggtccttggg 
cggacgagga 
gtcggttcct 
atccgctcct 
ggeatcatgg 
gttggaggtc 
gtaaatagtg 
ttgcaagcac 
ttgcaggcaa 
aagcagcccc 
aaaaatgaag 
ctgggtaaag 
ctggctctta 
agaagagaag 
tatttccatg 
tatagagaac 
gaattggcaa 
ccagagaact 
gtacatgctc 
gaaatgattg 
tgctatgaat 
aaaagaatat 
ctcatttcaa 
gaacacccct 
gctagcaaac 
taacctgaca 
gaacgctaca 
aaagctccac 
cttatactgg 
taggcatggt 
ctgccctgac 
gtgtaactta 
ttaagtatga 
taggaatgct 
taaccactta 
agggatttgc 
tttaaaacat 
tatgtctaaa 



tcgcaggtgg 

cggcggggac 

ccgtccctga 

gggcaccgag 

accgatctaa 

caaaacgtgt 

gccaggctca 

aaaagcttgt 

ccagtgtace 

tgccatcggc 

aatcaaaaaa 

gaaagtttgg 

aagtgttatt 

tagaaataca 

atgctaccag 

ttcagaaact 

atgccctgtc 

tacttcttgg 

catcttccag 

aaggtcggat 

ttttagttgg 

cacgggttga 

gactgttgaa 

ggatcacagc 

agtcttagga 

ggaacatgct 

caagaaatat 

atcaataaac 

ttca/taatct 

gtcttcacag 

cccgatcagt 

ttgggttggc 

aaataaagat 

gtgtgtctgt 

tctcccatat 

ttgggataca 

tggagtcata 

aaaaaaaaaa 



gagccgacgg 

aaggggcggc 

gtgtccttgg 

gcgccctgta 

agaaaactgc 

tctcgtgact 

gcgggtcttg 

ctccagtcac 

tcatcctgtc 

acctgaaaat 

gaggcagtgg 

taatgtttat 

taaagctcag 

gtcccacctt 

agtctaecta 

ttcaaagttt 

ttactgtcat 

atcagctgga 

gaggaccact 

gcatgatgag 

gaagcctcct 

attcacattc 

gcataatccc 

aaattcatca 

atcgtgcagg 

actgaagttt 

ttgttttact 

atgacactct 

ggaggcaagg 

gaggcaaatc 

taaggagctg 

gaagcctggt. 

atatgtacag 

ccggcacccc 

gagagtgtga 

gaagaggcca 

gcatgtgtgt 

aaa 



gtgggtagac 

tggtcggagt 

cgctgccttg 

ggatactgct 

atttcaggac 

cagcaaattc 

tgtccttcaa 

aagccggttc 

tccaggccac 

aatcctgagg 

gctttggaag 

ttggcaagag 

ctggagaaag 

cggcatccta 

attctgga:at 

gatgagcaga 

tcgaagagag 

gagcttaaaa 

ctctgtggca 

aaggtggatc 

tttgaggcaa 

cctgactttg 

agccagaggc 

aaaccatcaa 

gggagaaatc 

attttaccat 

cagcaggtgt 

gaagtgaaag 

ttcgactgca 

cagagcctgg 

tgcaataacc 

aaagctgttg 

acttgtattt 

ggtaggcctg 

aaaataggaa 

tgtgtctcag 

aaactttaaa 



cgtgggggat 60 
ggcggagcgt 120 
tgcccgccca 180 
tgttacttat 240 
ctgttaaggc 300 
cttgtcagaa 360 
attcttccca 420 
agaatcagaa 480 
tgaataacac 540 
aggaactggc 600 
actttgaaat 660 
aaaagcaaag 720 
ccggagtgga 780 
atattcttag 840 
atgcaccact 900 
gaactgctac 960 
ttattcatag 1020 
ttgcagattt 1080 
ccctggacta 1140 
tctggagcct 1200 
acacatacca 1260 
taacagaggg 1320 
caatgctcag 1380 
attgccaaaa 1440 
cttgagccag 1500 
tgactgctgc 1560 
gccttaacct 1620 
tagccacgag 1680 
gccgccccgt 1740 
ctgtggggaa 1800 
ttcctagtac 1860 
gaatgagtat 1920 
tttctctggt 1980 
attgggtttc 2040 
cacgtgctct 2100 
agctgttaag 2160 
tatgcaaata 2220 
2253 



<210> 324 
<211> 1619 
<212> DNA 

<213> Homo sapiens 



<400> 324 

ccgccagatt 

gacgttgccc 

ctggcccttc 

ccactgcccc 

ggaaggctgg 

cgctttcctt 

ggacagagaa 



tgaatcgcgg gacccgttgg cagaggtggc ggcggcggca 
cctgcctggc agccctttct caaggaccac cgcatctcta 
ttggagggct gcgcctgcac cccggagcgg atggccgagg 
actgagaacg agccagactt ggcccagtgt ttcttctgct 
gagccagatg acgaccccat agaggaacat aaaaagcatt 
tctgtcaaga agcagtttga agaattaacc cttggtgaat 
aqagccaaga acaaaattgc aaaggaaacc aacaataaga 
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tgggtgcccc 60 
cattcaagaa 120 
ctggcttcat 180 
tcaaggagct 240 
cgtccggttg 300 
ttttgaaact 360 
agaaagaatt 420 



SUBSTITUTE SHEET (RULE 26) 



BNSOOCID: <WO 0307B662AU_> 



WO 03/078662 



PCT/US03/07713 



3974O-O00lPCT.txt 

tgaggaaact gcgaagaaag tgcgccgtgc catcgagcag ctggctgcca tggattgagg 480 
cctctggccg gagctgcctg gtcccagagt ggctgcacca cttccagggt ttattccctg 540 
gtgccaccag ccttcctgtg ggccccttag caatgtctta ggaaaggaga tcaacatttt 600 
caaattagat gtttcaactg tgctcctgtt ttgtcttgaa agtggcacca gaggtgcttc 660 
tgcctgtgca gcgggtgctg ctggtaacag tggctgcttc tctctctctc tctctttttt 720 
gggggctcat ttttgctgtt ttgattcccg ggcttaccag gtgagaagtg agggaggaag 780 
aaggcagtgt cccttttgct agagctgaca gctttgttcg cgtgggcaga gccttccaca 840 
gtgaatgtgt ctggacctca tgttgttgag gctgtcacag tcctgagtgt ggacttggca 900 
ggtgcctgtt gaatctgagc tgcaggttcc ttatctgtca cacctgtgcc tcctcagagg 960 
acagtttttt tgttgttgtg tttttttgtt tttttttttt ggtagatgca tgacttgtgt 1020 
gtgatgagag aatggagaca gagtccctgg ctcctctact gtttaacaac atggctttct 1080 
tattttgttt gaattgttaa ttcacagaat agcacaaact acaattaaaa ctaagcacaa 1140 
agccattcta agtcattggg gaaacggggt gaacttcagg tggatgagga gacagaatag 1200 
agtgatagga agcgtctggc agatactcct tttgccactg ctgtgtgatt agacaggccc 1260 
agtgagccgc ggggcacatg ctggccgctc ctccctcaga aaaaggcagt ggcctaaatc 1320 
ctttttaaat gacttggctc gatgctgtgg gggactggct gggctgctgc aggccgtgtg 1380 
tctgtcagcc caaccttcac atctgtcacg ttctccacac gggggagaga cgcagtccgc 1440 
ccaggtcccc gctttctttg gaggcagcag ctcccgcagg gctgaagtct ggcgtaagat 1500 
gatggatttg attcgccctc ctccctgtca tagagctgca gggtggattg ttacagcttc 1560 
gctggaaacc tctggaggtc atctcggctg ttcctgagaa ataaaaagcc tgtcatttc 1619 

<210> 325 

<211> 5010 

<212> DNA 

<213> Homo sapiens 

<400> 325 

ggcggctcgg gacggaggac gcgctagtgt gagtgcgggc ttctagaact acaccgaccc 60 
tcgtgtcctc ccttcatcct gcggggctgg ctggagcggc cgctccggtg ctgtccagca 120 
gccataggga gccgcacggg gagcgggaaa gcggtcgcgg ccccaggcgg ggcggccggg 180 
atggagcggg gccgcgagcc tgtggggaag gggctgtggc ggcgcctcga gcggctgcag 240 
gttcttctgt gtggcagttc agaatgatgg atcaagctag atcagcattc tctaacttgt 300 
ttggtggaga accattgtca tatacccggt tcagcctggc tcggcaagta gatggcgata 360 
acagtcatgt ggagatgaaa cttgctgtag atgaagaaga aaatgctgac aataacacaa 420 
aggccaatgt cacaaaacca aaaaggtgta gtggaagtat ctgctatggg actattgctg 480 
tgatcgtctt tttcttgatt ggatttatga ttggctactt gggctattgt aaaggggtag 540 
aaccaaaaac tgagtgtgag agactggcag gaaccgagtc tccagtgagg gaggagccag 600 
gagaggactt ccctgcagca cgtcgcttat attgggatga cctgaagaga aagttgtcgg 660 
agaaactgga cagcacagac ttcaccagca ccatcaagct gctgaatgaa aattcatatg 720 
tccctcgtga ggctggatct caaaaagatg aaaatcttgc gttgtatgtt gaaaatcaat 780 
ttcgtgaatt taaactcagc aaagtctggc gtgatcaaca ttttgttaag attcaggtca 840 
aagacagcgc tcaaaactcg gtgatcatag ttgataagaa cggtagactt gtttacctgg 900 
tggagaatcc tgggggttat gtggcgtata gtaaggctgc aacagttact ggtaaactgg. 960 
tccatgctaa ttttggtact aaaaaagatt ttgaggattt atacactcct gtgaatggat 1020 
ctatagtgat tgtcagagca gggaaaatca cctttgcaga aaaggttgca aatgctgaaa 1080 
gcttaaatgc aattggtgtg ttgatataca tggaccagac taaatttccc attgttaacg 1140 
cagaactttc attctttgga catgctcatc tggggacagg tgacccttac acacctggat 1200 
tcccttcctt caatcacact cagtttccac catctcggtc atcaggattg cctaatatac 1260 
ctgtccagac aatctccaga gctgctgcag aaaagctgtt tgggaatatg gaaggagact 1320 
gtccctctga ctggaaaaca gactctacat gtaggatggt aacctcagaa agcaagaatg 1380 
tgaagctcac tgtgagcaat gtgctgaaag agataaaaat tcttaacatc tttggagtta 1440 
ttaaaggctt tgtagaacca gatcactatg ttgtagttgg ggcccagaga gatgcatggg 1500 
gccctggagc tgcaaaatcc ggtgtaggca cagctctcct attgaaactt gcccagatgt 1560 
tctcagatat ggtcttaaaa gatgggtttc agcccagcag aagcattatc tttgccagtt 1620 
ggagtgctgg agactttgga tcggttggtg ccactgaatg gctagaggga tacctttcgt 1680 
ccctgcattt aaaggctttc acttatatta atctggataa agcggttctt ggtaccagca 1740 
acttcaaggt ttctgccagc ccactgttgt atacgcttat tgagaaaaca atgcaaaatg 1800 
tgaagcatcc ggttactggg caatttctat atcaggacag caactgggcc agcaaagttg 1860 
agaaactcac tttagacaat gctgctttcc ctttccttgc atattctgga atcccagcag 1920 
tttctttctg tttttgcgag gacacagatt atccttattt gggtaccacc atggacacct 1980 
ataaggaact gattgagagg attcctgagt tgaacaaagt ggcacgagca gctgcagagg 2040 
tcgctggtca gttcgtgatt aaactaaccc atgatgttga attgaacctg gactatgaga 2100 
ggtacaacag ccaactgctt tcatttgtga gggatctgaa ccaatacaga gcagacataa 2160 
aggaaatggg cctgagttta cagtggctgt attctgctcg tggagacttc ttccgtgcta 2220 
cttccagact aacaacagat ttcgggaatg ctgagaaaac agacagattt gtcatgaaga 2280 
aactcaatga tcgtgtcatg agagtggagt atcacttcct ctctccctac gtatctccaa 2340 
aagagtctcc tttccgacat gtcttctggg gctccggctc tcacacgctg ccagctttac 2400 
tggagaactt gaaactgcgt aaacaaaata acggtgcttt taatgaaacg ctgttcagaa 2460 
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accagttggc 
tttgggacat 
gtagtctggt 
gatgttaaaa 
aatacagggt 
tgatggaata 
cctgttgttg 
agggctggtc 
catttgatgc 
ttccataagg 
gatgggcttg 
aggttcaact 
aaaagagggg 
ataagcctcc 
cagtttcagg 
aaattttggc 
tttattgttt 
aattatcgga 
aaaagcagca 
acttaagggt 
atttagagat 
aatgctggtt 
ggctgtggta 
tcccatttgc 
tcttccataa 
cagtgatctc 
aattttcttc 
ccaactccta 
gcattgggtg 
cttgggtttt 
acatttataa 
ctcctataaa 
agggactttg 
acgtctgcct 
cgtgagtgaa 
ggcctcctag 
tacagcctca 
atctcctctt 
attcctggtt 
ataacacaat 
tttctacagt 
ttcctgaatg 
ggagtagggc 



tctagctact 
tgacaatgag 
ttctagactt 
ttccatccca 
agataacctg 
ttcccctatc 
ttaatgaaaa 
tctgctgaag 
taaataggag 
aatgtagcag 
ttttcttgcc 
gttgattgca 
accagaagcc 
atttagttct 
tgtttagttg 
eaaagtgtta 
atttatcagt 
agcagtgcct 
tctgctaata 
tttagaaaac 
gaattaaatt 
ttcctaaatg 
gtactcctgc 
aaaatttcca 
tgtataaaga 
catatgttac 
atgcaatgac 
taattcccta 
ggtaaattca 
tgttaccttt 
tgtttgctat 
cttagtgcgg 
tatagaaggt 
acccattcgt 
ccatcatata 
ggaaaagttc 
cttgaatgtg 
ttatcttgga 
cgggtgttac 
atgaatacag 
tagggttgag 
aaatatcaga 
cttttggagg 



39740-000lPCT.txt 
tggactattc agggagctgc aaatgccctc 
ttttaaatgt gatacccata gcttccatga 
gtgctgatcg tgctaaattt tcagtagggc 
tcatcttggt actactagat gtctttaggc 
tacttcaagt taaagtgaat aaccacttaa 
tctagaattt taagtgcttt gtaatgggaa 
tgtcagaaac cagttatgtg aatgatctct 
gttgtaagtg gttcgcttac tttgagtgat 
ataccaggtt gaaagacctc tccaaatgag 
gtttcctcat tcctgaaaga aacagttaac 
aatgaggtct gaaatggagg tccttctgct 
ggaataaggc cttaatatgt taacctcagt 
aaagacttag tatattttct tttcctctgt 
ttgttatttt tgtttcttcc aaagcacatt 
cagactcagt ttgtcagact ttaaagaata 
atcttagggg agagctttct gtccttttgg 
gacagagttc actataaatg gtgttttttt 
tccataatta tgacagttat actgtcggtt 
aaacccaaca gatactggaa gttttgcatt 
agccgtcagc caaatgtaat tgaataaagt 
taattagggg ttgctaagaa gcgagcactg 
cagtgaattg tgaccaagtt ataaatcaat 
aaaattttat agctcagttt atccaaggtg 
gtacctttgt cacaatccta acacattatc 
acaaggtagt ttttacctac cacagtgtct 
actaagggtg taagtaatta tcgggaacag 
atcttcaaag cttgaagatc gttagtatct 
tcttttagtt ttagttgcag aaacattttg 
accactgtaa aatgaaatta ctacaaaatt 
atggtttctc caggtcctct acttaatgag 
tgacaagtca ttttaattta tcacattatt 
acaagtttta atccagaatt gaccttttga 
ttgggggctg tggggaagga gagtcccctg 
ggtgatcaat taaatgtagg tatgaataag 
aacgtgtagt acagctgttt gtcatagggc 
atagggtctc ttcaggttct tagtgtcact 
tcactactca cagtctcttt aatcttcagt 
ctgacattta gcgtagctaa gtgaaaaggt 
gcacacgtac ttaaatgaaa gcatgtggca 
ggcatgcatt ttgcagcagt gagtctcttc 
ttacttccta tcaagccagt acgtgctaac 
ctagtgacaa gctcctggtc ttgagatgtc 
taaaggtata 



tctggtgacg 
gaacagcagg 
tacaaaacct 
agcagctttt 
aaaatgtcca 
ctgcctcttt 
ctgaatccta 
cctccaactt 
atctaagcct 
tttcagaaga 
ggataaaatg 
gtcatttatg 
cccttccccc 
gaaagagaac 
atatgctgcc 
cactgagata 
aatagaatat 
ttttttaaat 
tatggtcaac 
tgaagctaag 
accagataag 
gtcacttaaa 
taactctaat 
gggagcagtg 
gtatcggaga 
tgtttcccat 
aacatgtatc 
tggtcattaa 
tgaaatttag 
atagcagcat 
tgcatgttac 
cttaaagcag 
aaggtctgac 
ttcgaagctc 
agttggaaac 
tacctagatt 
tttatcttta 
catagctgag 
tgttcatcgt 
agaaaaccct 
aggctcaata 
ttctcgttaa 



2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5010 



<2lO> 326 
<211> 2574 
<212> DNA 

<213> Homo sapiens 



<400> 326 

cctgtttaga 

cctcagggtt 

caccttcttg 

cttctgcacg 

tcccagctca 

ctttgccacg 

gaagaagagg 

ccctgagcca 

ccgggagctg 

cgagtcggaa 

gcacaacgaa 

gtcctcagtg 

gcccaacccc 

agatgagcac 

tgccgagtgg 

gtccaactta 



cacatggaca 
gccagcgctt 
ccaagcctca 
tctgctgggg 
cacatgaaga 
gtcagcctct 
gtggaagcca 
acggtgatga 
ctggaggaga 
tactatgcca 
ctggctgtct 
gagaaaaata 
agctctaagc 
attgccaaac 
ctgtcctttg 
ggtctagaaa 



acaatcccag 
cctggaagtc 
gtctttggga 
tctcttcctc 
tgcacttgca 
ctctgtccac 
ttaggggaca 
cccacgtccc 
tgcatgggga 
aagaaatcca 
gccctaaagg 
gaaccaacct 
ggaatgagca 
agcgctatat 
atgtcactga 
tcagcattca 



cgctacaagg cacacagtcc 
ctgaagctct cgcagtgcag 
tctggggagg ccgcctggtt 
tccaggcctt gccgtccccc 
aagggctctg gtggtcctgg 
ttgcaccacc ttggacttcg 
gatcttgagc aagctcaggc 
ctatcaggtc ctggcccttt 
gagggaggaa ggctgcaccc 
taaattcgac atgatccagg 
aattacctcc aaggttttcc 
attccgagca gaattccggg 
gaggatcgag ctcttccaga 
cggtggcaag aatctgccca 
cactgtgcgt gagtggctgt 
ctgtccatgt cacacctttc 
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gcttcttcgt 60 
tgagttcatg 120 
ttcctccctc 180 
tggcctctct 240 
ccctgctgaa 300 
gccacatcaa 360 
tcaccagccc 420 
acaacagcac 480 
aggaaaacac 540 
ggctggcgga 600 
gcttcaatgt 660 
tcttgcgggt 720 
tccttcggcc 780 
cacggggcac 840 
tgagaagaga 900 
agcccaatgg 960 



SUBSTITUTE SHEET (RULE 26) 



BNSDOC1D: <WO 03078662A1J_> 



WO 03/078662 



PCT/US03/077IJ 



agatatcctg 
ggatgaccat 
tcatctaatc 
gaggaagaag 
tgtgcgcccc 
taagggctac 
aacccacagc 
ttgctgcgtg 
caaagtggag 
gtgcgacaga 
acacaagcaa 
ggctgagatg 
ggtaaagaaa 
ctgtgacgca 
tggcaatggg 
ggccagactg 
gctctaggga 
acgaagacaa 
cttttgcaaa 
ggtcatgcag 
gaaagggtgg 
tcgatcatat 
tggattgtgg 
aecctgtgtt 
agctgcacat 
agtatgaata 
atttcatctt 



gaaaacattc 
ggccgtggag 
ctcatgatga 
cgggctttgg 
ctctacattg 
tatgccaact 
acggtgctgg 
ccccaggacc 
cagctctcca 
gagaggggag 
caaacctcac 
gaggtttcct 
gtgtgggttt 
gacagagggg 
atttgggcta 
gaagacactt 
atctggatta 
agtcccagaa 
ctgtcctcta 
ttcctggccc 
aaatcaaccc 
ttccccttgg 
ttccatgcag 
catttggtgt 
gtgccacaca 
ttactctcaa 
ctggaatatt 



39740-000lPCT.txt 
acgaggtgat ggaaatcaaa ttcaaaggcg tggacaatga 1020 
atetggggcg cctcaagaag cagaaggatc accacaaccc 1080 
ttcccccaca ccggctcgac aacccgggcc aggggggtca 1140 
acaccaatta ctgcttccgc aacttggagg agaactgctg 1200 
acttccgaca ggatctgggc tggaagtggg tccatgaaec 1260 
tctgctcagg cccttgccca tacctccgca gtgcagacac 1320 
gactgtacaa cactctgaac cctgaagcat ctgcctcgcc 1380 
tggagcccct gaccatcctg tactatgttg ggaggacccc 1440 
acatggtggt gaagtcttgt aaatgtagct gagaccccac 1500 
agagaaccac cactgcctga ctgcccgctc ctcgggaaac 1560 
tgagaggcct ggagcccaca accttcggct ccgggcaaat 1620 
tttggaacat ttctttcttg ctggctctga gaatcacggt 1680 
ggttagagga aggctgaact cttcagaaca cacagacttt 1740 
atggggatag aggaaaggga tggtaagttg agatgttgtg 1800 
ccctaaaggg agaaggaagg gcagagaatg gctgggtcag 1860 
cagatctgag gttggatttg ctcattgctg taccacatct 1920 
tgttatacaa ggcaagcatt ttttttttta aagacaggtt 1980 
ttgtatctca tactgtctgg gattaagggc aaatctatta 2040 
catcaattaa catcgtgggt cactacaggg agaaaatcca 2100 
atcaactgta ttgggccttt tggatatgct gaacgcagaa 2160 
tctcctgtct gccctctggg tccctcctct cacctctccc 2220 
acacttggtt agacgccttc caggtcagga tgcacatttc 2280 
ccttggggca ttatgggtct tcccccactt cccctccaag 2340 
tcctggaagc aggtgctaca acatgtgagg catrtcgggga 2400 
gtgacttggc cccagacgca tagactgagg tataaagaca 2460 
aatctttgta taaataaata tttttggggc atcctggatg 2520 
gtttctagaa cagtaaaagc cttattctaa ggtg 2574 



<210> 327 
<2U> 1421 
<212> DNA 

<213> Homo sapiens 



<400> 327 

acttactgcg 

agctgcatta 

caggtgattc 

cgcttccaga 

agcagcagct 

cgagacgtgg 

ttccctgaca 

gctgcactgg 

ctggccgaga 

tataccaaga 

cactccgtgt 

aacaaagaga 

tttgccccac 

ccgctccctt 

caggaggaag 

tgtgtggctg 

cttccctctc 

ggatctggca 

cctgcttctt 

gccagcacct 

cccaccctcc 

cccttcctac 

ggcaccaacc 

agggtccacg 



ggacggcctt 
acctgcccac 
tcgggccgat 
ttgctcagta 
tctgcacaca 
cccaggaggc 
tcatggagtt 
atgggacctt 
gcgtggtgaa 
ggctcggcac 
gtcggctctg 
actgcccagt 
agcagattct 
cctgccactg 
tcgggaggcg 
ccccacctgc 
agctgctggg 
cactccctct 
cccctctgcg 
ttgagccttg 
cctgaggatg 
ctctggtgat 
ttgctgggac 
cctctgctgt 



ggagagtact 
tgtgctgccc 
gttctcagga 
caagtgcctg 
tgaccggaac 
cctgggcgtg 
ctgcgaggcc 
ccagaggaag 
gctgacggcg 
agagaaggag 
ctacttcaag 
gccaggaaag 
gcaatgcagc 
ccgcctactg 
tggagggtga 
cgcatgctcc 
acgatcgccc 
ccttggggtg 
gctttcactg 
gcccacactg 
gcctggattc 
ggtttccaca 
ttggatccca 
agcttatgaa 



cgggttcgtg 
ggctccccca 
aaaagcacag 
gtgatcaagt 
accatggagg 
gctgtcatag 
atggccaacg 
ccatttgggg 
gtgtgcatgg 
gtcgaggtga 
aaggcctcag 
ccaggggaag 
cctgccaact 
gacgctgccc 
ccacaccttg 
ctcctctcct 
aggctggagc 
agggacagag 
ctgagtttct 
aggcttaggc 
acgccctctt 
ggaacaacag 
ggggcttatc 
attaactaat 



aacttcccgg 
gcaagacccg 
agttgatgag 
atgccaaaga 
cgctgcccgc 
gcatcgacga 
ccgggaagac 
ccatcctgaa 
agtgcttccg 
ttgggggagc 
gccagcctgc 
ccgtggctgc 
gagggacctg 
tgcatgctgc 
gccttctggg 
acccactggt 
tggccccgct 
ccccacgctg 
gttctccctg 
ctctctgcct 
gtttcctttt 
catctttcac 
tcttcaagtg 
t 



aggcgcaatg 
ggggcagatc 
acgcgtccgt 
cactcgctac 
ctgcctgctc 
ggggcagttt 
cgtaattgtg 
cctggtgccg 
ggaagccgcc 
agacaagtac 
cgggccggac 
caggaagctc 
caagggccgc 
ccagccactc 
aactctcctt 
ctgcttaaag 
tggtggcctg 
ttgacatcag 
ggaagcctgt 
gggatgggct 
gggctcaaag 
caagatgggt 
tggagagggc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1421 



<210> 328 
<211> 4604 
<212> DNA 

<213> Homo sapiens 
<400> 328 

ggaacagctt gtccacccgc cggccggacc agaagccttt gggtctgaag tgtctgtgag 60 
acctcacaga agagcacccc tgggctccac ttacctgccc cctgctcctt cagggatgga 120 
ggcaatggcg gccagcactt ccctgcctga ccctggagac tttgaccgga acgtgccccg 180 
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gatctgtggg 
aggctgcaaa 
caacggggac 
acgctgtgtg 
gaagcgggag 
caagctgtct 
ctacgacccc 
tggagggagc 
ctcctcctcc 
ctccaatctg 
ccagctctcc 
tggctttgct 
gctgaagtca 
cgacatgtcc 
agccggacac 
gctgaacttg 
tcgtcctggg 
actgcagacg 
gatgatccag 
ctgcctctcc 
tggcaatgag 
agggccacgt 
gttcagcccc 
ctaacccctt 
tgtttgtttg 
cccagagatg 
gagaaatgca 
gcttgtccag 
cacagctccc 
tcacgggcag 
acccacctgc 
tgacagctgt 
aggtggaagc 
aaaagcacct 
cagccctccc 
tccccagggg 
gatggaggag 
tgtggttgca 
gtgggaaggg 
gtgatgtagg 
tgtgaggcag 
cccagctggt 
gtaggtgtgc 
agatcaatgc 
agtgggaacg 
ccctgaaaac 
cccatgtctc 
gaatagcata 
actccatttg 
tatgggggga 
tttacaaggg 
gcttgctctt 
caggaaaact 
ggccaggaca 
ggtaggtgga 
tgtctctatt 
gcctgtaatc 
ttgcagtgag 
tcttagaagt 
tacaccatgt 
ggggaagccc 
ctggcttgta 
aatgtgaatc 
gccttggata 
ttcccagctc 
ccctgggccc 
aaaagagggg 
tactttagaa 



gtgtgtggag 
ggcttcttca 
tgccgcatca 
gacatcggca 
atgatcctga 
gaggagcagc 
acctactccg 
catccttcca 
tgctcagatc 
gatctgagtg 
atgctgcccc 
aagatgatac 
agtgccattg 
tggacctgtg 
agcctggagc 
catgaggagg 
gtgcaggacg 
tacatccgct 
aagctagccg 
ttccagcctg 
atctcctgac 
gccaggcccg 
tcctctgcca 
tcctgcgggc 
acaaagaaac 
cctccaccgc 
tccattcctc 
cgcctagtgg 
accccacccc 
tagctgtggt 
tgagagaccc 
ccctgtcacc 
cctccttcca 
gccgacctcg 
tgtcctcctc 
agagggtctg 
aagaattttc 
aggccgctgt 
agagcggggc 
gtaaggtgcc 
gggcggagtg 
gggaagcgtc 
gggaccggta 
ttgagtttgg 
tgactgcccc 
ccagagagga 
tcagaattct 
tatatcgctt 
tattatagtc 
gaacttacat 
tctagggaga 
cagtgggaga 
gacagaggag 
ggccgggcgc 
tcacctgagg 
aaaaatacaa 
ccagctactc 
ctgagatcgt 
ggaccaggac 
ttgagctcag 
tggatttcag 
actccaccaa 
agaccctacc 
atgctgttgc 
tccctctggc 
caccacctag 
aaaatgagtt 
tgagagtgtg 



39740-0001PCT . txt 
accgagccac tggctttcac ttcaatgcta 
ggcgaagcat gaagcggaag gcactattca 
ccaaggacaa ccgacgccac tgccaggcct 
tgatgaagga gttcattctg acagatgagg 
agcggaagga ggaggaggcc ttgaaggaca 
agcgcatcat tgccatactg ctggacgccc 
acttctgcca gttccggcct ccagttcgtg 
ggcccaactc cagacacact cccagcttct 
actgtatcac ctcttcagac atgatggact 
aagaagattc agatgaccct tctgtgaccc 
acctggctga cctggtcagt tacagcatcc 
caggattcag agacctcacc tctgaggacc 
aggtcatcat gttgcgctcc aatgagtcct 
gcaaccaaga ctacaagtac cgcgtcagtg 
tgattgagcc cctcatcaag ttccaggtgg 
agcatgtcct gctcatggcc atctgcatcg 
ccgcgctgat tgaggccatc caggaccgcc 
gccgccaccc gcccccgggc agccacctgc 
acctgcgcag cctcaatgag gagcactcca 
agtgcagcat gaagctaacg ccccttgtgc 
taggacagcc tgtgcggtgc ctgggtgggg 
gggctggcgg ctactcagca gccctcctca 
cctcccctat ccacccagcc cattctctct 
ttttccccgg tcccttgaga cctcagccat 
ccaagtgggg gcagagggca gaggctggag 
tgcctaagtg gctgctgact gatgttgagg 
agggacagag acacctgcac ctccccccac 
ggtctccctc tcctgcctta ctcacgataa 
cttcagtgcc caccaacatc ccattgccct 
gaggtgggtt ttcttcccat cactggagca 
aaggaggaaa aacagacaaa aacagcctca 
aagctcacag ttcctcgccc tgggtctaag 
cggatccatg tagcaggact gaattgtccc 
tcctccccct gccagtgcct tacctcctgc 
ggatcaccga gagtagccga gagcctgctc 
gagaagcagt gagccgcatc ttctccatct 
agaccccagc ggctgagtca tgatctccct 
tcaccacagg gctaagagct aggctgccgc 
agtctcgggt ggctagtcag agagagtgtt 
ttcttattct cactccacca cccaaaagtc 
atacaacttc aagtgcatgc tctctgcagg 
tgtccgttta ctccaaggtg ggtctttgtg 
cagaaaggcg ttcttcgagg tggatcacag 
aatcggccgc attccctgag tcaccaggaa 
aactcctgga agctgtgtcc ttgcacctgc 
atcagacttc acactgcaag agccttggtg 
tcaggtggaa aaacatctga aagccacgtt 
aatcttaaat ttattagata tgagttgttt 
taatatacag ggtagcaggt accactgatt 
tgtgaaactt ctgtacatta attattattg 
qacccttgtt tgattttagc tgcagaactg 
aaaacacttg taagttgcta aacgagtcaa 
qgcgtgactc acccaagcca tatataacta 
qgtggctcac gcctgtaatc ccagcagttt 
tcgggagttc gagaccaacc tgaccaacat 
aaaaaaaaaa aaaaaaaaat agccgggcat 
aggaggctga ggcagaagaa ttgaacccag 
qccgttactc tccaacctgg acaacaagag 
aggaccagat tttggagtca tggtccggtg 
acccccactc tcattcccca ggtggctgac 
aaagagccaa gtctggatct gggacccttt 
gcccatcaga aggagaagga aggagactca 
ccaccacgat gtgccctggc tgctgggctc 
ctcatctata acatgcattt gtctttgtaa 
cctgcttctt cggggaactc ctgaaatatc 
gccactcctc caaaggaagt ctaggagctg 
tttatggggc tgaacgggga gaaaaggtca 
aaatagacat ttgtaaatgt aaaactttta 
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tgacctgtga 
cctgcccctt 
gccggctcaa 
aagtgcagag 
gtctgcggcc 
accataagac 
tgaatgatgg 
ctggggactc 
cgtccagctt 
tagagctgtc 
aaaaggtcat 
agatcgtact 
tcaccatgga 
acgtgaccaa 
gactgaagaa 
tctccccaga 
tgtccaacac 
tctatgccaa 
agcagtaccg 
tcgaagtgtt 
ctgctcctcc 
cccgtctggg 
cctgtccaac 
gaggagttgc 
gcaggccttg 
gaacagacag 
tgcaggcccc 
ataatcggcc 
ggttatattc 
ccaggcacga 
cagaagaata 
gggttggttg 
cagtttgcag 
ccaggagagc 
ccccaccccc 
ggcagggtgg 
gccgcctcaa 
accccagagt 
tgggggttcc 
aaaaggtgcc 
tcgagcccag 
agagtgagct 
aggcttcttc 
tgttaaagtc 
atccgtagtt 
tccacctggc 
ccttactgca 
tcagactcag 
tggagatatt 
ctgttgttat 
tattggtcca 
tcccctcatt 
gctagaagtg 
gggaggtcga 
ggagaaaccc 
ggtggcgcaa 
gaggtggagg 
cgaaactccg 
tccttttcac 
ccagtccctg 
ccttccttcc 
cctctgcctc 
tccacctcag 
tgtcaccacc 
agttactcag 
ggaggaaaag 
tcatcgattc 
aggtatatca 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 . 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
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ttataactga aggagaaggt gccccaaaat gcaagatttt ccacaagatt cccagagaca 4320 

ggaaaatcct ctggctggct aactggaagc atgtaggaga atccaagcga ggtcaacaga 4380 

gaaggcagga atgtgtggca gatttagtga aagctagaga tatggcagcg aaaggatgta 4440 

aacagtgcct gctgaatgat ttccaaagag aaaaaaagtt tgccagaagt ttgtcaagtc 4500 

aaccaatgta gaaagctttg cttatggtaa taaaaatggc tcatacttat atagcactta 4560 

ctttgtttgc aagtactgct gtaaataaat gctttatgea aacc 4604 

<210> 329 
<211> 2076 
<212> DNA 

<213> Homo sapiens 
<400> 329 

cggggaaggg gagggaggag ggggacgagg gctctggcgg gtttggaggg gctgaacatc 60 
gcggggtgtt ctggtgtccc ccgccccgcc tctccaaaaa gctacaccga cgcggaccgc 120 
ggcggcgtcc tccctcgccc tcgcttcacc tcgcgggctc cgaatgcggg gagctcggat 180 
gtccggtttc ctgtgaggct tttacctgac acccgccgcc tttccccggc actggctggg 240 
agggcgccct gcaaagttgg gaacgcggag ccccggaccc gctcccgccg cctccggctc 300 
gcccaggggg ggtcgccggg aggagcccgg gggagaggga ccaggagggg cccgcggcct 360 
cgcaggggcg cccgcgcccc cacccctgcc cecgccagcg gaccggtcec ccacccccgg ^20 
tccttccacc atgcacttgc tgggcttctt ctctgtggcg tgttctctgc tcgccgctgc 480 
gctgctcccg ggtcctcgcg aggcgcccgc cgccgccgcc gccttcgagt ccggactcga 540 
cctctcggac gcggagcccg acgcgggcga ggccacggct tatgcaagca aagatctgga 600 
ggagcagtta cggtctgtgt ccagtgtaga tgaactcatg actgtactct acccagaata 660 
ttggaaaatg tacaagtgtc agctaaggaa aggaggctgg caacataaca gagaacaggc 720 
caacctcaac tcaaggacag aagagactat aaaatttgct gcagcacatt ataatacaga 780 
gatcttgaaa agtattgata atgagtggag aaagactcaa tgcatgccac gggaggtgtg 840 
tatagatgtg gggaaggagt ttggagtcgc gacaaacacc ttctttaaac ctccatgtgt 900 
gtccgtctac agatgtgggg gttgctgcaa tagtgagggg ctgcagtgca tgaacaccag 960 
cacgagctac ctcagcaaga cgttatttga aattacagtg cctctctctc aaggccccaa 1020 
accagtaaca atcagttttg ccaatcacac ttcctgccga tgcatgtcta aactggatgt 1080 
ttacagacaa gttcattcca ttattagacg ttccctgcca gcaacactac cacagtgtca 1140 
ggcagcgaac aagacctgcc ccaccaatta catgtggaat aatcacatct gcagatgcct 1200 
ggctcaggaa gattttatgt tttcctcgga tgctggagat gactcaacag atggattcca 1260 
tgacatctgt ggaccaaaca aggagctgga tgaagagacc tgtcagtgtg tctgcagagc 1320 
ggggcttcgg cctgccagct gtggacccca eaaagaacta gacagaaact catgccagtg 1380 
tgtctgtaaa aacaaactct tccccagcca atgtggggcc aaccgagaat ttgatgaaaa 1440 
cacatgccag tgtgtatgta aaagaacctg ccccagaaat caacccctaa atcctggaaa 1500 
atgtgcctgt gaatgtacag aaagtccaca gaaatgcttg ttaaaaggaa agaagttcca 1560 
ccaccaaaca tgcagctgtt acagacggcc atgtacgaac cgccagaagg cttgtgagcc 1620 
aggattttca tatagtgaag aagtgtgtcg ttgtgtccct tcatattgga aaagaccaca 1680 
aatgagctaa gattgtactg ttttccagtt catcgatttt ctattatgga aaactgtgtt 1740 
gccacagtag aactgtctgt gaacagagag acccttgtgg gtccatgcta acaaagacaa 1800 
aagtctgtct ttcctgaacc atgtggataa ctttacagaa atggactgga gctcatctgc 1860 
aaaaggcctc ttgtaaagac tggttttctg ccaatgacca aacagccaag attttcctct 1920 
tgtgatttct ttaaaagaat gactatataa tttatttcca ctaaaaatat tgtttctgca 1980 
ttcattttta tagcaacaac aattggtaaa actcactgtg atcaatattt ttatatcatg 2040 
caaaatatgt ttaaaataaa atgaaaattg tattat 2076 

<210> 330 
<211> 2819 
<212> DNA 

<213> Homo sapiens 
<400> 330 

ctgggcccag ctcccccgag aggtggtcgg atcctctggg ctgctcggtc gatgcctgtg 60 

ccactgacgt ccaggcatga ggtggttcct gccctggacg ctggcagcag tgacagcagc 120 

agccgccagc accgtcctgg ccacggccct ctctccagcc cctacgacca tggactttac 180 

cccagctcca ctggaggaca cctcctcacg cccccaattc tgcaagtggc catgtgagtg 240 

cccgccatcc ccaccccgct gcccgctggg ggtcagcctc atcacagatg gctgtgagtg 300 

ctgtaagatg tgcgctcagc agcttgggga caactgcacg gaggctgcca tctgtgaccc 360 

ccaccggggc ctctactgtg actacagcgg ggaccgcccg aggtacgcaa taggagtgtg 420 

tgcacaggtg gtcggtgtgg gctgcgtcct ggatggggtg cgctacaaca acggccagtc 480 

cttccagcct aactgcaagt acaactgcac gtgcatcgac ggcgcggtgg gctgcacacc 540 

actgtgcctc cgagtgcgcc ccccgcgtct ctggtgcccc cacccgcggc gcgtgagcat 600 

acctggccac tgctgtgagc agtgggtatg tgaggacgac gccaagaggc cacgcaagac 660 

cgcaccccgt gacacaggag ccttcgatgc tgtgggtgag gtggaggcat ggcacaggaa 720 

ctgcatagcc tacacaagcc cctggagccc ttgctccacc agctgcggcc tgggggtctc 780 
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cactcggatc 

cttgcggcca 

tgtgtaccag 

ctatcaaccc 

taagactatc 

atggattaat 

cttggaatcc 

ttggggacta 

atgagcctta 

atgacgcctg 

catctactct 

cagcctttcc 

caggtagact 

gtccattcag 

ccaagaggta 

atatgtatgc 

ttgattttta 

tcacccctac 

aaacattcct 

cccgtttcag 

gggcagttgg 

attggtaagg 

tctactctta 

ccaagacagg 

cctggggtga 

catctattgc 

tggtaggaag 

aaagctgaag 

tttattgtat 

tgtgccagac 

gttgagaagt 

tgagaaaact 

attggactgt 

aataggaaaa 



tccaatgtta 

tgcgatgtgg 

ccagaggcat 

aagtactgtg 

gacgtgtcct 

gcctgcttct 

taccctgact 

acccaatgcc 

gttaccctga 

atggtgctgc 

aaagaaaaat 

aagtcactag 

tttaatatca 

atgaatagat 

ctgtaatggg 

acctcaaggt 

atggaaagtt 

actgtgaagg 

gttgatggga 

aaggaccaat 

ccaggctctt 

cctctggact 

cagagccttg 

ttcaggtagg 

cctctagagc 

agagacteat 

atggaggttt 

aggttgaagc 

taggaaaata 

attgctctca 

tcccattatt 

caaccagagt 

ctttataacc 

cattgcaata 



39740-000lPCT.txt 
acgcccagtg ctggcctgag caagagagcc 
acatccatac actcattaag gcagggaaga 
ccatgaactt cacacttgcg ggctgcatca 
gagtttgcat ggacaatagg tgctgcatcc 
tccagtgtcc tgatgggctt ggcttctccc 
gtaacctgag ctgtaggaat cccaatgaca 
tctcagaaat tgccaactag gcaggcacaa 
tgtgaagcag tcagccctta tggccaataa 
tctggaccct tggcctccat ttctgtctct 
tcaggcccat gctatgagtt ttctccttga 
gcctgtctct agctgttctg gactacaccc 
aagtcctgct ggatcttgcc taaatcccaa 
ctaatttctt ctttagatgc caaaccacaa 
ggaatttgga acaatagaat aatctattat 
taattctgac gtcagcgcac caaaactatc 
catcaaacat ttgccaagtg agttgaatag 
gtatccatta acctgggcat tgttgaggtt 
gtacagatta ggtttgtccc agtcagaaat 
aaagccccca gttaatactc cagagacagg 
tgactctcac actgaatcag ctgctgactg 
ccttgaatct tctcccttgt cctgcttggg 
ggcctgtctg geccctgaga gtggtgccct 
agagacccag ctgcagacca tgccagaccc 
ggtgtgggtc aaaccaagaa gtgggtgccc 
tggaggctgt gggaetccag gggcccccgt 
ttcacagcct ttcgttctgc tgaccaaatg 
accggttgtt tagaaacaga aatagactta 
taaaaggaaa aggttgttgt taatgaatat 
taatatttac tgttagaatt cttttattta 
gtgctttgca tgtattagct cactgaatct 
atttctgttc ttacaaatgt gaaacggaag 
cacccagttg gtgactggga aagttaggat 
catattttcc ccctgttttt agagcttcca 
aatggcttga ttttttaaaa aaaaaaaaaa 



gcctctgcaa 

agtgtctggc 

gcaeacgctc 

ectacaagtc 

gccaggtcct 

tctttgctga 

atcttgggtc 

cttttcacca 

aaccattcaa 

tatcattcag 

aagcctgatc 

gaaatggaat 

gactctttgg 

ttggagcctg 

ctgattccaa 

ttgcttaatt 

aagtttctct 

aaaatttgat 

gaaaggtcag 

gcagggcttt 

gttcatagga 

ggaacactcc 

actgaaatga 

ttggtagcag 

gttcaggaca 

gccagttttc 

ataaaggttt 

caggctatta 

gggccttttc 

tcacgacaat 

ctcatagagg 

tcagatcgaa 

aatgtgtcag 

aaaaaaaaa 



840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2819 



<210> 331 
<211> 2540 
<212> DNA 

<213> Homo sapiens 



<400> 331 

gaaaaggtgg 

aaaacttgtg 

ttaaaaactt 

gcagggtttc 

gtagatagat 

tgcagattta 

atccagaatg 

gacaggccat 

tcagacacca 

tttcagaact 

tactacacag 

tgggaacctt 

gttttgggcc 

ttcccaaatt 

tttacttttg 

tatgctttag 

tggaagccca 

ctgttagaac 

gagtgtctgg 

atcttccaaa 

aagaaaataa 

ctggttgcag 

tcattacaga 

tgcaaaatct 

acttgtaaac 

ttcaagcaaa 

tattaccctg 



acaagtccta 

tacctgcaga 

ttgctaattt 

tttatactgg 

ggcaatatgg 

tcaacggctt 

gtcagtacaa 

ctgagacaca 

tatacccgag 

ggccagacta 

gtattggtga 

gtgatcgtgc 

ggaatcttaa 

caacaaatct 

ggacatggat 

gtgaaggtga 

gtgaagaccc 

agaagggaca 

taagaactac 

atcctatggt 

tggaggaaaa 

atctagtgaa 

aagagattag 

gtatggatag 

aatgtgctga 

aaatttttat 

attgaatgtg 



ttttcaagag 

catcaataag 

tccaagtggt 

tgaaggagat 

agactcagca 

ttatcttgaa 

agttgaaaac 

tgcagactat 

gaaccctgcc 

tgctcaccta 

ccaagtgcag 

ctggtcagaa 

tattcgaagt 

tccaagaaat 

atactcagtt 

taaagtaaag 

ttgggaacaa 

agaatatata 

tgagaaaaca 

acaagaagct 

aattcagata 

tgctcagaaa 

tactgaagag 

aaatattgct 

agcagttgac 

gtcttaatct 

tgatgtgaac 



aagatgactt 
gaagaagaat 
agtcctgttt 
accgtgcggt 
gttggaagac 
aatagtgcca 
tatctgggaa 
cttttgagaa 
atgtattgtg 
accccaagag 
tgcttttgtt 
cacaggcgac 
gaatctgatg 
ccatccatgg 
aacaaggagc 
tgctttcact 
catgctaaat 
aacaatattc 
ccatcactaa 
atacgaatgg 
tctgggagca 
gacagtatgc 
cagctaaggc 
atcgtttttg 
aagtgtccca 
aactctatag 
tgactttaag 
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ttaacagttt 

ttgtagaaga 

cagcatcaac 

gctttagttg 

acaggaaagt 

cgcagtctac 

gcagagatca 

ctgggcaggt 

aagaagctag 

agttagcaag 

gtggtggaaa 

actttcctaa 

ctgtgagttc 

cagattatga 

agcttgcaag 

gtggaggagg 

ggtatccagg 

atttaactca 

ctagaagaat 

ggttcagttt 

actataaatc 

aagatgagtc 

gcctgcaaga 

ttccttgtgg 

tgtgctacac 

taggcatgtt 

taatcaggat 

5 



tgaaggatct 

gtrtaataga 

actggcacga 

tcatgcagct 

atccccaaat 

aaattctggt 

ttttgcctta 

tgtagatata 

attaaagtcc 

tgctggactc 

actgaaaaat 

ttgcttcttt 

tgataggaat 

agcacggatc 

agctggattt 

gctaactgat 

gtgcaaatat 

ttcacttgag 

tgatgatacc 

caaggacatt 

acttgaggtt 

aagtcagact 

ggagaagctt 

acatctagtc 

agtcattact 

atgttgttct 

tgaattccat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 
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tagcatttgc taccaagtag gaaaaaaaat gtacatggca gtgttttagt tggcaatata 1680 
atctttgaat ttcttgattt ttcagggtat tagctgtatt atccattttt tttactgtta 1740 
tttaattgaa accatagact aagaataaga agcatcatac tataactgaa cacaatgtgt 1800 
attcatagta tactgattta atttctaagt gtaagtgaat taatcatctg gattttttat 1860 
tcttttcaga taggcttaac aaatggagct ttctgtatat aaatgtggag attagagtta 1920 
atctccccaa tcacataatt tgttttgtgt gaaaaaggaa taaattgttc catgctggtg 1980 
gaaagataga gattgttttt agaggttggt tgttgtgttt taggattctg tccattttct 2040 
tgtaaaggga taaacacgga cgtgtgcgaa atatgtttgt aaagtgattt gccattgttg 2100 
aaagcgtatt taatgataga atactatcga gccaacatgt actgacatgg aaagatgtca 2160 
gagatatgtt aagtgtaaaa tgcaagtggc gggacactat gtatagtctg agccagatca 2220 
aagtatgtat gttgttaata tgcatagaac gagagatttg gaaagatata caccaaactg 2280 
ttaaatgtgg tttctcttcg gggagggggg gattggggga ggggccccag aggggtttta 2340 
gaggggcctt ttcactttcg acttttttca ttttgttctg ttcggatttt ttataagtat 2400 
gtagaccccg aagggtttta tgggaactaa catcagtaac ctaacccccg tgactatcct 2460 
gtgctcttcc tagggagctg tgttgtttcc cacccaccac ccttccctct gaacaaatgc 2520 
ctgagtgctg gggcactttg 2540 

<210> 332 

<211> 1474 

<212> DNA 

<213> Homo sapiens 

<400> 332 

aaaaagaaat caagaatgca attttattta caatagtcac gccggaaata cctagaaata 60 
aatttaactg aggatgtaaa agacctctac aaggagagtt caatgcgtag cgggagcgga 120 
gagctgaccc cagagagccc tgggcagccc cacctccgcc gccggcctag ttaccatcac 180 
accccggaga geccgcagct gccgcagccg gccccagtca ccatcaccgc aaccatgagc 240 
agcgaggccg agacccagca gccgcccgcc gccccccccg ccgcccccgc cctcagcgcc 300 
gccgacacca agcccggcac taccggagcg gcgcagggag cggtggcccg ggcggctcac 360 
atcggcggcg ctggcgcggg cgacaagaag gtcatcgcaa cga:aggtttt gggaacagta 420 
aaatggttca atgtaaggaa cggatatggt ttcatcaaca ggaatgacac caaggaagat 480 
gtatttgtac accagactgc cataaagaag aataacccca ggaagtacct tcgcagtgta 540 
ggagatggag agactgtgga gtttgatgtt gttgaaggag aaaagggtgc ggaggcagca 600 
aatgttacag gtcctggtgg tgttccagtt caaggcagta aatatgcagc agaccgtaac 660 
cattatagac gctatccacg tcgtaggggt cctccacgca attaccagca aaattaccag 720 
aatagtgaga gtggggaaaa gaacgaggga tcggagagtg ctcccgaagc caggcccaac 780 
aacgccggcc ctacgcaggc gaaggttccc accttactac atgcggagac ctatgggcgt 840 
cgaccacagt attccaaccc tcctgtgcag ggagaagtga tggagggtgc tgacaaccag 900 
ggtgcaggag aacaaggtag accagtgagg cagatatgta tcggggatat agaccacgat 960 
tccgcagggg ccctcctcgc caaaagacag cctagagagg acggcaatga agaagataaa 1020 
gaaaatcaag gagatgagac ccaaggtcag cagccacctc aagctcggta ccgccgcaac 1080 
ttcaattacc gacgcagacg cccagaaaac cctaaaccac aagatggcaa agagacaaaa 1140 
gcagccgatc caccagctga gaattcgtcc gctcccgagg ctgagcaggg cggggctgag 1200 
taaatgccgg cttaccatct ctaccatcat ccggtttagt catccaacaa gaagaaatat 1260 
gaaattccag caataagaaa tgaacaaaag attggagctg aagacctaaa gtgcttgctt 1320 
tttgcccgtt gaccagataa atagaactat ctgcattatc tatgcagcat ggggttttta 1380 
ttatgtttta cctaaagacg tctctttttg gtaataacaa accgtgtttt ttaaaaaagc 1440 
ctggtttttc tcaatacgcc tttaaaggaa ttcc 1474 

<210> 333 

<211> 4079 

<212> DNA 

<213> Homo sapiens 

<400> 333 

ggagcggcgg gcgggcggga gggctggcgg ggcgaacgtc tgggagacgt ctgaaagacc 60 
aacgagactt tggagaccag agacgcgcct ggggggacct ggggcttggg gcgtgcgaga 120 
tttcccttgc attcgctggg agctcgcgca gggatcgtcc catggccggg gctcggagcc 180 
gcgacccttg gggggcctcc gggatttgct acctttttgg ctccctgctc gtcgaactgc 240 
tcttctcacg ggctgtcgcc ttcaatctgg acgtgatggg tgccttgcgc aaggagggcg 300 
agccaggcag cctcttcggc ttctctgtgg ccctgcaccg gcagttgcag ccccgacccc 360 
agagctggct gctggtgggt gctccccagg ccctggctct tcctgggcag caggcgaatc 420 
gcactggagg cctcttcgct tgcccgttga gcctggagga gactgactgc tacagagtgg 480 
acatcgacca gggagctgat atgcaaaagg aaagcaagga gaaccagtgg ttgggagtca 540 
gtgttcggag ccaggggcct gggggcaaga ttgttacctg tgcacaccga tatgaggcaa 600 
ggcagcgagt ggaccagatc ctggagacgc gggatatgat tggtcgctgc tttgtgctca 660 
gccaggacct ggccatccgg gatgagttgg atggtgggga atggaagttc tgtgagggac 720 
gcccccaagg ccatgaacaa tttgggttct gccagcaggg cacagctgcc gccttctccc 780 
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SUBSTITUTE SHEET (RULE 26) 
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■PCT/USOJ/07713 



ctgatagcca 
ttgtgaccaa 
ctgaccggct 
ttgactcggg 
gcgceaacca 
ccgaggttat 
ctgacctcaa 
gccaagaaga 
ggatctcccc 
tcctggggga 
gtgatgggaa 
aggtgctgga 
tggatatgga 
tgctcttcag 
gcatcgacct 
tctgtttcag 
tgttagatgc 
gccgtaacct 
atgaccgagt 
gggccattgt 
ctggccaggg 
gggcagagat 
tgcagctggt 
ccatggatgt 
tggagctgat 
atgcccatga 
gggccctgga 
agtgtgagct 
gcacctccgg 
tcagtgagca 
tgtccattgc 
agagagccat 
ccaaccaagg 
atgagattgc 
aggggcctgg 
acagtaggga 
ggcaggagcc 
ccctgigactg 
ttgaccgcgc 
agtactcagc 
ccataaagaa 
accccatggc 
ctgggctgct 
gggcgaagca 
accgacagca 
cccggcggga 
gccccgatgg 
ggctgccctc 
gctggtgtcg 
ccacaagaac 
tcagggacag 
gagaagggat 
tgcctccccg 
gctgctcctt 
gtttcgtggt 



ctacctcctc 
cattgatagc 
cccaggacca 
gaaaggtctg 
caagggtgct 
gctgtctggg 
cagtgatggc 
gctggggggt 
tctccggctc 
cctcaaccaa 
agtcttcatc 
gggcgaggct 
tgggaaccaa 
ggccagaccc 
ggagcagccc 
ctacattgca 
ggacacagac 
ggaagaaccc 
ctgtggagac 
agtgaccttg 
gctgcctcca 
ccacttcctg 
ccacgcccgc 
ggatggaaca 
ggtcaccaac 
agcccagctc 
ccctgcggag 
ggggaacccc 
gatcagcatt 
ggagctgcat 
aggaatggcc 
gcagtctgag 
ccagtcgctc 
caatgggaag 
gcagaaaggg 
taggaggcgg 
cagcatgtcc 
cgcccggggc 
ggctgtgctg 
tgtgaagtcc 
cttgatgctc 
tgtggtggca 
ggtgctagca 
ccccgaggcc 
gttcaaggag 
gggcccggat 
gcatccaggg 
catcccttcc 
catcaagatt 
tcctcccacc 
ggccatgggg 
cctaatccct 
gaagtgcctt 
ctctagtttc 
ttcgtctatt 



39740-000lPCT.txt 
tttggggccc caggaaccta taattggaag 
tcagaccccg accagctggt gtataaaact 
gccggagact tggccctcaa tagctactta 
gtgcgtgcag aagagctgag ctttgtggct 
gtggttatcc tgcgcaagga cagcgccagt 
gagcgcctga cctccggctt tggctactca 
tggccagacc tgatagtggg tgccccctac 
gctgtgtatg tgtacttgaa ccaggggggt 
tgcggctccc ctgactccat gttcgggatc 
gatggctttc cagatattgc agtgggtgcc 
taccatggga gcagcctggg ggttgtcgcc 
gtgggcatca agagcttcgg ctactccctg 
taccctgacc tgctggtggg ctccctggct 
atcctccatg tctcccatga ggtctctatt 
aactgtgctg gcggccactc ggtctgtgtg 
gtccccagca gctatagccc tactgtggcc 
cggaggctcG ggggccaggt tccccgtgtg 
aagcaccagg cctcgggcac cgtgtggctg 
gccatgttcc agctccagga aaatgtcaaa 
tcctacagtc tccagaGccc tcggctccgg 
gtggccccca tcctcaatgc ccaccagccc 
aagcaaggct gtggtgaaga caagatctgc 
ttctgtaccc gggtcagcga cacggaattc 
acagccctgt ttgcactgag tgggcagcca 
ctgccatcgg acccagccca gccccaggct 
ctggtcatgc ttcctgactc actgcactac 
aagecactct gcctgtccaa tgagaatgcc 
atgaagagag gtgcccaggt caccttctac 
gagaccacgg aactggaggt agagctgctg 
ccagtctctg cacgagcccg tgtcttcatt 
attccccagc aactcttctt ctctggtgtg 
cgggatgtgg gcagcaaggt caagtatgag 
agaaccctgg gctctgcctt cctcaacatc 
tggttgctgt acccaatgca ggttgagctg 
ctttgctctc ccaggcccaa catcctccac 
cgggagctgg agccacctga gcagcaggag 
tggtggccag tgtcctctgc tgagaagaag 
acggccaaet gtgtggtgtt cagctgccca 
catgtctggg gccgtctctg gaacagcacc 
ctggaagtga ttgtccgggc caacatcaca 
cgagatgcct ccacagtgat cccagtgatg 
gaaggagtgc cctggtgggt catcctcctg 
ctgctggtgc tgctcctgtg gaagatggga 
accgtgcccc agtaccatgc ggtgaagatt 
gagaagacgg gcaccatcct gaggaacaac 
qcacacccca tcctggctgc tgacgggcat 
ccaggcaccg cctaggttcc catgtcccag 
ccagagatgg ctccttggga tgaagagggt 
tggcaggatc ggcttcctca ggggcacaga 
caacttcccc ttagagtgct gtgagatgag 
tagggtgaga agggcagggg tgtcctgatg 
tcctctccca ttcaccctgt gtaacaggac 
aacctagagg gtcggggagg aggttgtgtc 
ccctctcatc tgaccttagt ttgctgccat 
tattaaaaaa tatttgagaa caaaaaaaaa 



gggttgcttt 
ttggaccctg 
ggcttctcta 
ggagcccccc 
cgcctggtgc 
ctggctgtgg 
ttctttgagc 
cactgggctg 
agcctggctg 
ccctttgatg 
aaaccttcac 
tcaggcagct 
gacaccgcag 
gctccacgaa 
gacctaaggg 
ctggactatg 
acgttcctga 
aagcaccagc 
gacaagcttc 
cgacaggctc 
agcacccagc 
cagagcaatc 
caacctctgc 
gtcattggcc 
gatggggatg 
tcaggggtcc 
tcccatgttg 
ctcatcctta 
ttggccacga 
gagctgccac 
gtgaggggcg 
gtcacggttt 
atgtggcctc 
gagggcgggc 
ctggatgtgg 
cctggtgagc 
aaaaacatca 
ctctacagct 
tttctggagg 
gtgaagtcct 
gtatacttgg 
gctgtactgg 
ttcttcaaac 
cctcgggaag 
tggggcagcc 
cccgagctgg 
cctggcctgt 
agagtgggct 
cctctcccac 
agtgggtaaa 
caaaggtggg 
cccaaggacc 
actgactcag 
cagtctagtg 
aaaaaaaaa 



840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4079 



<210> 334 
<2U> 3373 
<212> DNA 

<213> Homo sapiens 



<400> 334 

ggtggcaact 

cctcccccgc 

agatagagat 

aaaagcccac 

cgcagctcgc 

gaccgacgga 



tctcctcctg 
tgcctcccta 
actttgcgcg 
cctccagcct 
agcccgcagc 
ctcgcgccgc 



cggccgggag 
gggctcccct 
cacacacata 
cgctgcaaag 
ccgcagagga 
gtccacctgt 



cggcctgcct gcctccctgc 
ccggccgcca gcgcccattt 
catacgcgcg caaaaaggaa 
agaaaaccgg agcagccgca 
cgcccagagc ggcgagcagg 
cggccgggcc cagccgagcg 
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gcacccgcag 60 
ttcattccct 120 
aaaaaaaaaa 180 
gctcgcagct 240 
cgggcagacg 300 
cgcagcgggc 360 



SUBSTITUTE SHEET (RULE 26) 



BNSDOCID: <WO 03078662A1J_> 



WO 03/078662 



PCT/US03/07713 



acgccgcgcg 

tcccgccccc 

ctgccgttgc 

cccagctccg 

ccggcgggca 

ttccttttcc 

ctggttcgcg 

gtagccggcc 

ccggggccgg 

gcgcctgatc 

cgcgggggcg 

gcatgaatcg 

ccgaggggga 

cctttgatga 

tggacctgaa 

gaaggagcct 

gcaccgaggt 

tgtggccgcc 

agtgccgccc 

ggaagaagcc 

gtgagacagt 

gagccaaaac 

agggcaagca 

gagcctaggg 

tattgccccc 

gcctgattcg 

atcagcgggt 

aggactgaac 

agagagactg 

ggccacacct 

gcctgatccc 

cctcggactc 

ggtgcccaaa 

gttttcgttt 

ggtgccctct 

tgggtatccc 

cctctggttc 

agtctgctcc 

tcctctctag 

acccagccac 

gcagggagca 

tggcagatgt 

ggactggctg 

ggtaggggca 

tccagcaact 

tgcagccctc 

agctccacct 

aagggaaaag 

ctactgcact 

tgattatatt 

tatttaaagt 



cgcggagcag 

ctacccggcc 

aaagccaact 

cgctttccga 

gaggaaaacg 

tcttggaggt 

acgcagcccc 

tcggacccag 

gggcggcggc 

gccgcggacc 

gcgcgctcga 

ctgctgggcg 

ccccattccc 

tctccaacgc 

catgacccgc 

gggttccctg 

gttcgagatc 

ctgtgtggag 

cacccaggtg 

aatctttaag 

ggcagctgca 

gccccaaact 

ccggaaattc 

gcatcggcag 

atggggtcct 

gacggccaat 

ctcctcccag 

tccatcgcca 

atggggtcgc 

gagcgctgtg 

tgaacccctg 

catggctaag 

caccagtcac 

tgaagacgtg 

caggtgggtt 

ctgcctcctg 

ctttcatctc 

accaaggggc 

ggcagaagac 

gactgccatg 

ctgttcacat: 

aggtaaagag 

tgcccctgat 

gggaggcaac 

gcccttccag 

ttgcccgcac 

ctggtggctc 

atccccaaga 

ttcccccttc 

tttgggggct 

tgt 



39740-000lPCT.txt 
ccgtgcccgc cgcccgggcc cgccgccagg 
cgggcgggag tttgcacctc tccctgcccg 
ttggaaaaag ttttttgggg gagacttggg 
ttttgggggc ctttccagaa aatgttgcaa 
cctgtagccg gcgagtgaag acgaaccatc 
tggagtcccc tgggcgcccc cacacggcta 
ccggccgtgg atgctgcact cgggctcggg 
gtcctgcgcc caggtcctcc cctgcccccc 
gccgggggca tgcgggtgag ccgcggctgc 
tgagccgagc ccacccccct ccccagcccc 
tctacgcgtc cggggccccg cggggccggg 
ctcttcctgt etctctgctg ctacctgcgt 
gaggagcttt atgagatgct gagtgaccac 
ctgctgcacg gagaccccgg agaggaagat 
tcccactctg gaggcgagct ggagagcttg 
accattgctg agccggccat gatcgccgag 
tcccggcgcc tcatagaecg caccaacgcc 
gtgcagcgct gctccggctg ctgcaacaac 
cagctgtgac ctgtccaggt gagaaagatc 
aaggccacgg tgacgctgga agaccacctg 
cggcctgtga cccgaagccc ggggggttcc 
cgggtgacca ttcggacggt gcgagtccgc 
aagcacacgc atgacaagac ggcactgaag 
gagagtgtgt gggcagggtt atttaatatg 
tggagtgata atattgtttc cctcgtccgt 
ggtgcttccc ccacccctcc acgtgtccgt 
cggcctccgg tcttgcccag eagctcaaag 
tcttcttccc ttaactccaa gaacttggga 
tctttggggg aaacgggttc cttcccctgc 
gactgtcctg aggagccctg aggacctctc 
gccagctctg aggggaggca ectccaggca 
accacagacg ggcacacaga ctggagaaaa 
ctcgtctccc tggtgcctct gtgcacagtg 
gactcctctt ggtgggtgtg gccagcacac 
agagatggag tttgctgttg aggtggtgta 
ccaccccttc ctccccatac tccactctga 
tctacctcca ccctgcattt tcctcttgtc 
tcttgaaccc cttattaagg ccccagatga 
tagaggccag ggcagcaagg gacctgctca 
taaggttgtg cagggtgtgt actgcacaag 
catagataaa gctgatttgt atatttatta 
gaaaaggatc cttttcctaa ttcacacaaa 
gcagcctgtg gctggagt^g ccaaatagga 
actgctgtcc acatgacctc catttcccaa 
gtgggtgtgg gacacctggg agaaggtctc 
ccctccctgc ttgcacactt ccccatcttt 
ctcctaggaa accagctcgt gggctgggaa 
ccccctgggg tgggatctga gctcccacct 
ccgccttcca aaacctgctt ccttcagttt 
ttccttttat tttttaaatg taaaatttat 



gcgcacacgc 420 
ggtgctcgag 480 
ccttgaggtg 540 
aaaagctaag 600 
gactgccgtg 660 
gacgcctcgg 720 
atccgcccag 780 
agcgacggag 840 
agaggcetga 900 
ccaccctggc 960 
cccggagtcg 1020 
ctggtcagcg 1080 
tcgatccgct 1140 
ggggccgagt 1200 
gctcgtggaa 1260 
tgcaagacgc 1320 
aacttcctgg 1380 
cgcaacgtgc 1440 
gagattgtgc 1500 
gcatgcaagt 1560 
caggagcagc 1620 
cggcccccca 1680 
gagacccttg 1740 
gtatttgctg 1800 
ctgtctcgat 1860 
ccacccttcc 1920 
aagaaaaaga 1980 
taagagtgtg 2040 
acctggcctg 2100 
agcatagcct 2160 
ggccaggctg 2220 
cccctcccac 2280 
gcttcttttc 2340 
caagtggctg 2400 
gatggtgacc 2460 
ttcacctctt 2520 
ctggcccttc 2580 
ccccagtcac 2640 
tcatattcca 2700 
gacattgtat 2760 
tgacaatttc 2820 
gactccttgt 2880 
gggagactgt 2940 
agtcctctgc 3000 
caagggaggg 3060 
gatccttctg 3120 
tgggggagag 3180 
cccttcccac 3240 
gtaaagtcgg 3300 
ttatattccg 3360 
3373 



<210> 335 
<211> 2304 
<212> DNA 

<213> Homo sapiens 



<400> 335 

gtccccgcag 

ccccgacctc 

gagcgactcc 

tggaggaaca 

gaaattcgga 

tcacttttac 

ctctgccact 

cctggggaaa 

gcaggtgggc 

aaagccctcc 



cgccgtcgcg 
gccaccatga 
aaaggcagca 
tgtgtgtcca 
gggcagcact 
cgaggaaagg 
gtccttcagc 
cataattact 
ctaaagccgc 
tctcctccag 



ccctcctgcc 
gagccctgct 
atgaacttca 
acaagtactt 
gtgaaataga 
ccagcactga 
aaacgtacca 
gcaggaaccc 
ttgtccaaga 
aagaattaaa 



gcaggccacc gaggccgccg 
ggcgcgcctg cttctctgcg 
tcaagttcca tcgaactgtg 
ctccaacatt cactggtgca 
taagtcaaaa acctgctatg 
caccatgggc cggccctgcc 
tgcccacaga tctgatgctc 
agacaaccgg aggcgaccct 
gtgcatggtg catgactgcg 
atttcagtgt ggccaaaaga 
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ccgtctagcg 60 
tcctggtcgt 120 
actgtctaaa 180 
actgcccaaa 240 
aggggaatgg 300 
tgccctggaa 360 
ttcagctggg 420 
ggtgctatgt 480 
cagatggaaa 540 
ctctgaggcc 600 



BNSDOCID: <WO 0307B662A1 J_> 



SUBSTITUTE SHEET (RULE 26) 



WO 03/078662 



PCT/US03/07713 



ccgctttaag 
catctacagg 
cccttgctgg 
catcgtctac 
ggtggaaaac 
cattgccttg 
acagaccatc 
cactggcttt 
tgttgtgaag 
caccaccaaa 
ctcaggggga 
ctggggccgt 
cttaccctgg 
agggaggaaa 
tccatcagct 
caccaccagg 
cagaccctct 
tgtctttttc 
ggctcgaagg 
aatgaataat 
aatgtgggag 
attccatgaa 
gctgtgagtg 
aaactgtgtg 
ctggggcctc 
acctgtgacc 
atcecttcct 
acactgaata 
atcaataaaa 



attattgggg 
aggcaccggg 
gtgatcagcg 
ctgggtcgct 
ctcatcctac 
ctgaagatcc 
tgcctgccct 
ggaaaagaga 
ctgatttccc 
atgctatgtg 
cccctcgtct 
ggatgtgccc 
atccgcagtc 
cgggcaccac 
gtaagaagag 
gtgaacgaca 
ggccaggatg 
tggactgaag 
gagagccagc 
ttcccaatta 
cagcggtttg 
tgtatcagga 
taagtgtgag 
gactgtgatg 
ttgggtcccc 
agcactgtet 
tttagcctag 
tttatatttc 
tgtgattttt 



39740-000lPCT.txt 
gagaattcac caccatcgag aaccagccct 
dqqgctctgt cacctacgtg tgtggaggca 
ccacacactg cttcattgat tacccaaaga 
caaggcttaa ctccaacacg. caaggggaga 
acaaggacta cagcgctgac acgcttgctc 
gttccaagga gggcaggtgt g^gcagccat 
cgatgtataa cgatccccag tttggcacaa 
attctaccga ctatctctat ccggagcagc 
accgggagtg tcagcagccc cactactacg 
ctgctgaccc ccaatggaaa acagattcct 
qttccctcca aggccgcatg actttgactg 
tgaaggacaa gccaggcgtc tacacgagag 
acaccaagga agagaatggc ctggccctcr 
ccgctttctt gctggttgtc atttttgcag 
actgggaaga taggctctgc acagatggat 
atagctttac cctcacggat aggectgggi: 
qaggggtggt cctgactcaa catgttactg 
cctgcaggag ttaaaaaggg cagggcatct 
tcccccgacc ggtgggcatt tgtgaggccc 
ggaagtgtaa gcagctgagg tctcttgagg 
qggagcagag acactaacga cttcagggca 
aatatatatg tgtgtgtatg "tgcacact 
taagagctgg tgtctgattg ttaagtctaa 
ccacacagag tggtctttct 99agaggtta 
cacgtgacag tgcctgggaa tgtacttatt 
cagtttcact ttcacataga tgtccctttc 
ttcatccaat cctcactggg tggggtgagg 
actattttta tttatatttt tgtaatttta 

ctga 



ggtttgcggc 660 
gcctcatcag 720 
aggaggacta 780 
tgaagtttga 840 
accacaacga 900 
cccggactat 960 
gctgtgagat 1020 
tgaaaatgac 1080 
gctctgaagt 1140 
gccagggaga 1200 
gaattgtgag 1260 
tctcacactt 1320 
gagggtcccc 1380 
tagagtcatc 1440 
ttgcctgtgg 1500 
gctggctgcc 1560 
accagcaact 1620 
cctgtgcatg 1680 
atggttgaga 1740 
gagcttagcc 1800 
gggctctgat 1860 
tgttgngtgg 1920 
atatttcctt 1980 
taggtcactc 2040 
ctgcagcatg 2100 
ttggccagtt 2160 
accactcctt 2220 
aataaaagtg 2280 
2304 



<210> 336 
<211> 1876 
<212> DNA 

<213> Homo sapiens 



<400> 336 
cgcggccgcg 
tggcccatag 
aacccaagga 
tgaatagtga 
catggatcag 
catgactccc 
acagcctatt 
gcaacaacaa 
gcagcagcag 
agccgttcag 
gctcttccac 
ctcccccatg 
gattgtaccg 
aaagaccatt 
aatcatgagg 
gtgcacagga 
tgtacagaag 
gagctgtgat 
tagtagttat 
tgttctcctt 
aatttatgaa 
atggctctca 
gtttgttttg 
gtggcaccag 
tttgtgcact 
gggcagcgct 
tgagggagaa 
aatgtttttc 
taaaagtgtt 
cacctctcca 
tttttttctt 
aaaaaaaaaa 



gttcgctgtg 
tgatctttgc 
attgaggaag 
gacgagttcc 
aacaacagcc 
ggaatcccta 
cagaacacca 
cagcagcagc 
cagcagcagc 
cagtcaacgt 
tcacagactc 
actcccatga 
cagctgcaaa 
gcacttcgtg 
ataagagagc 
gccaagagtg 
ttgggttttc 
gtgaagtttc 
gagccagagt 
atttttgttt 
gcatttgaaa 
tgtacccttg 
gtacctttaa 
gtgatgccct 
gagaacaccg 
gcccatttat 
aactttaagt 
cccatgaacc 
gtttttctaa 
gtattgcagg 
tcttcagagt 
aaaaaa 



gcgggcgcct 
agtgacccag 
ttgctgagaa 
agcgcaaggg 
tgccacctta 
tctttagtcc 
atagtctgtc 
agcagcagca 
agcagcagca 
cccagcaggc 
tcacaactgc 
cccccatcac 
atattgtate 
cccgaaacgc 
cacgaaccac 
aagaacagtc 
cagctaagtt 
ctataaggtt 
tatttcctgg 
ctggaaaagt 
acatctaccc 
cctcccccac 
atggtggtgt 
tctgtaagtg 
cgcagcgtga 
ttatatgtag 
gttaaagcca 
acagttttta 
tttataactc 
acggaatata 
actctgtaca 



gggccgccgg 
cagcatcact 
gagtgtgctg 
tttctggttt 
cgctcagggc 
aatgatgcct 
tattttggaa 
gcagcagcaa 
gcagcagcag 
aacacaggga 
acccttgccg 
tcctgccacg 
cacagtgaat 
cgaatataat 
ggcactgatt 
cagactggca 
cttggacttc 
agaaggcctt 
tttaatctac 
tgtattaaca 
tattctaaag 
ccccttcttt 
tgtgagaaga 
cccaccgcgg 
ctgtgagttg 
attttaaaca 
cctctataat 
tatttctacc 
ctaggggtta 
tgtgttaatg 
ataaatgcag 



ctgtttaact 
gtrtcttggc 
gagatgctct 
gccaagaaga 
ttggcctccc 
tatggcactg 
gagcaacaaa 
cagcaacagc 
caacaggcag 
acctcaggcc 
ggcaccactc 
ccagcttcgg 
cttggttgta 
cccaagcggt 
ttcagt^tctg 
gcaagaaaat 
aagattcaga 
gtgctcaccc 
agaatgatca 
ggtgctaaag 
ggattcagga 
tttttttttt 
tggatgttga 
gatgccggga 
ctcataccgt 
ctgctgttga 
tgattggact 
agaaaagtaa 
tttctgtgcc 
aaaatgaatg 
tttataaaag 



tcgcttccgc 60 
gtgtgaagat 120 
aggaaaaaat 180 
aagtgaacat 240 
ctcagggtgc 300 
gactgacccc 360 
ggcagcagca 420 
agcagcagca 480 
tggcagctgc 540 
aggcaccaca 600 
cactgtatcc 660 
agagttctgg 720 
aacttgacct 780 
ttgctgcggt 840 
ggaaaatggt 900 
atgctagagt 960 
acatggtggg 1020 
accaacaatt 1080 
aacccagaat 1140 
tcagagcaga 1200 
agacgacgta 1260 
aaacaaatca 1320 
gttgcagggt 1380 
aggggcatta 1440 
gctgctatct 1500 
caagttggtt 1560 
ttttaatttt 1620 
aaatcttttt 1680 
agacacattc 1740 
gctgtacata 1800 
tgttaaaaaa 1860 
1876 
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39740-000lPCt.txt 

<210> 337 
<211> 6633 
<212> DNA : 
<213> Homo sapiens 

<400> 337 . 
ttctccccgc cccccagttg ttgtcgaagt ctgggggttg ggactggacc ccctgattgc 60 
gtaagagcaa aaagcgaagg cgcaatctgg acactgggag attcggagcg cagggagttt 120 
gagagaaact tttattttga agagaccaag gttgaggggg ggcttatttc ctgacagcta 180 
tttacttaga gcaaatgatt agttttagaa ggatggacta taacattgaa tcaattacaa 240 
aacgcggttt ttgagcccat tactgttgga gctacaggga gagaaacagg aggagactgc 300 
aagagatcat ttgggaaggc cgtgggcacg ctctttactc catgtgtggg acattcattg 360 
cggaataaca tcggaggaga agtttcccag agctatgggg acttcccatc cggcgttcct 420 
ggtcttaggc tgtcttctca cagggctgag cctaatcctc tgccagcttt cattaccctc 480 
tatccttcca aatgaaaatg aaaaggttgt gcagctgaat tcatcctttt ctctgagatg 540 
ctttggggag agtgaagtga gctggcagta ccccatgtct gaagaagaga gctccgatgt 600 
ggaaatcaga aatgaagaaa acaacagcgg cctttttgtg acggtcttgg aagtgagcag 660 
tgcctcggcg gcccacacag ggttgtacac ttgctattac aaccacactc agacagaaga 720 
gaatgagctt gaaggcaggc acatttacat ctatgtgcca gacccagatg tagcctttgt 780 
acctctagga atgacggatt atttagtcat cgtggaggat gatgattctg ccattatacc 840 
ttgtcgcaca actgatcccg agactcctgt aaccttacac aacagtgagg gggtggtacc 900 
tgcctcctac gacagcagac agggctttaa tgggaccttc actgtagggc cctatatctg 960 
tgaggccacc gtcaaaggaa agaagttcca gaccatccca tttaatgttt atgctttaaa 1020 
agcaacatca gagctggatc tagaaatgga agctcttaaa accgtgtata agtcagggga 1080 
aacgattgtg gtcacctgtg ctgtttttaa caatgaggtg gttgaccttc aatggactta 1140 
ccctggagaa gtgaaaggca aaggcatcac aatgctggaa gaaatcaaag tcccatccat 1200 
caaattggtg tacactttga cggtccccga ggccacggtg aaagacagtg gagattacga 1260 
atgtgctgcc cgccaggcta ccagggaggt caaagaaatg aagaaagtca ctatttctgt 1320 
ccatgagaaa ggtttcattg aaatcaaacc caccttcagc cagttggaag ctgtcaacct 1380 
gcatgaagtc aaacattttg ttgtagaggt gcgggcctac ccacctccca ggatatcctg 1440 
gctgaaaaac aatctgactc tgattgaaaa tctcactgag atcaccactg atgtggaaaa 1500 
gattcaggaa ataaggtatc gaagcaaatt aaagctgatc cgtgctaagg aagaagacag 1560 
tggccattat actattgtag ctcaaaatga agatgctgtg aagagctata cttttgaaet 1620 
gttaactcaa gttccttcat ccattctgga cttggtcgat gatcaccatg gctcaactgg 1680 
gggacagacg gtgaggtgca cagctgaagg cacgccgctt cctgatattg agtggatgat 1740 
atgcaaagat attaagaaat gtaataatga aacttcctgg actattttgg ccaacaatgt 1800 
ctcaaacatc atcacggaga tccactcccg agacaggagt accgtggagg gccgtgtgac 1860 
tttcgccaaa gtggaggaga ccatcgccgt gcgatgcctg gctaagaatc tccttggagc 1920 
tgagaaccga gagctgaagc tggtggctcc caccctgcgt tctgaactca cggtggctgc 1980 
tgcagtcctg gtgctgttgg tgattgtgat catctcactt attgtcctgg ttgtcatttg 2040 
gaaacagaaa ccgaggtatg aaattcgctg gagggtcatt gaatcaatca gcccggatgg 2100 
acatgaatat atttatgtgg acccgatgca gctgccttat gactcaagat gggagtttcc 2160 
aagagatgga ctagtgcttg gtcgggtctt ggggtctgga gcgtttggga aggtggttga 2220 
aggaacagcc tatggattaa gccggtccca acctgtcatg aaagttgcag tgaagatgct 2280 
aaaacccacg gccagatcca gtgaaaaaca agctctcatg tctgaactga agataatgac 2340 
tcacctgggg ccacatttga acattgtaaa cttgctggga gcctgcacca agtcaggccc 2400 
catttacatc atcacagagt attgcttcta tggagatttg gtcaactatt tgcataagaa 2460 
tagggatagc ttcctgagcc accacccaga gaagccaaag aaagagctgg atatctttgg 2520 
attgaaccct gctgatgaaa gcacacggag ctatgttatt ttatcttttg aaaacaatgg 2580 
tgactacatg gacatgaagc aggctgatac tacacagtat gtccccatgc tagaaaggaa 2640 
agaggtttct aaatattccg acatccagag atcactctat gatcgtccag cctcatataa 2700 
gaagaaatct atgttagact cagaagtcaa aaacctcctt tcagatgata actcagaagg 2760 
ccttacttta ttggatttgt tgagcttcae ctatcaagtt gcccgaggaa tggagttttt 2820 
ggcttcaaaa aattgtgtcc accgtgatct ggctgctcgc aacgtcctcc tggcacaagg 2880 
aaaaattgtg aagatctgtg actttggcct ggccagagac atcatgcatg attcgaacta 2940 
tgtgtcgaaa ggcagtacct ttctgcccgt gaagtggatg gctcctgaga gcatctttga 3000 
caacctctac accacactga gtgatgtctg gtcttatggc attctgctct gggagatctt 3060 
ttcccttggt ggcacccctt accccggcat gatggtggat tctactttct acaataagat 3120 
caagagtggg taccggatgg ccaagcctga ccacgctacc agtgaagtct acgagatcat 3180 
ggtgaaatgc tggaacagtg agccggagaa gagaccctcc ttttaccacc tgagtgagat 3240 
tgtggagaat ctgctgcctg gacaatataa aaagagttat gaaaaaattc acctggactt 3300 
cctgaagagt gaccatcctg ctgtggcacg catgcgtgtg gactcagaca atgcatacat 3360 
tggtgtcacc tacaaaaacg aggaagacaa gctgaaggac tgggagggtg gtctggatga 3420 
gcagagactg agcgctgaca gtggctacat cattcctctg cctgacattg accctgtccc 3480 
tgaggaggag gacctgggca agaggaacag acacagctcg cagacctctg aagagagtgc 3540 
cattgagacg ggttccagca gttccacctt catcaagaga gaggacgaga ccattgaaga 3600 
catcgacatg atggacgaca tcggcataga ctcttcagac ctggtggaag acagcttcct 3660 
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gtaactggcg 
aaaaccactt 
gttcccagcc 
gaactttgtc 
tggagataga 
attggtgaga 
gtaaataact 
aagagaccac 
aactttttaa 
gcattttgct 
aatagatttg 
aatactacta 
gagaaagcac 
atttatatta 
tagttgagca 
gtactgaata 
tgggattaga 
ctgtgctgaa 
agactggatt 
cccaatatat 
tcagcctcct 
agatattctt 
gtggcagcca 
tgagggaaac 
cggaaacact 
aacagggttg 
gtgacattta 
tgaggctgag 
cccactgtat 
ccccgccaag 
accagctcag 
tgggaggatg 
cattctttgc 
ctgcaatcac 
acttgactac 
atagtaagtg 
tgtagagcca 
gtttatcgct 
gctgtgagcc 
gtcctaacaa 
tttcagcaaa 
tgccaatctt 
atttgtgact 
aatggtccta 
tatatgtatt 
actgccttcg 
cttttgaatg 
ccaaaacaat 
aaatgtatta 
tatatttcaa 



gattcgaggg 
tattgcaatg 
aagggcctcg 
agtgttgcct 
tggataaggg 
gtccaacaga 
ctaaccaagg 
tcaatccatc 
agaagtgcat 
atctttttta 
ggtcatttag 
ctgttatcag 
aatttaaaac 
aataacatgt 
tagagaacaa 
ggttccccaa 
aacaaacaaa 
cataacttct 
tgcagaagtt 
gtattttttg 
tctttcaccc 
tagtggaggc 
ggatgactag 
cagagtetgt 
gacttaggtt 
gaattcaacc 
atgccatcta 
aaagctaaag 
gggggagatt 
gaaaggcatg 
tgttttggtg 
aattgtcaca 
aatactgctt 
tgtgctatcg 
ctactggtgt 
cgaagactga 
attagacttg 
cactctccct 
ttgcatgaca 
atgctcccac 
ttccagattt 
tcctactttc 
ttttaaacga 
tttttgtgaa 
tctatataga 
tttatatttt 
tccaaaattt 
ggtgtggtga 
cgaatgcccc 
taaatgatat 



39740-OO0lPCT.txt 
gttccttcca cttctggggc cacctctgga 
cggaggttga gaggaggact tggttgatgt 
ggqagcgttc taaatatgaa tgaatgggat 
ctcgcaatgc ctcagtagca tctcagtggt 
aataataggc cacagaaggt gaactttgtg 
cacaatttat actgcgacag aacttcagca 
ctgtgtttag attgtattaa ctatcttctt 
catgtacttc cctcttgaaa cctgatgtca 
gaaaaaccat ttttgaacct taaaaggtac 
qtgttaagag ataaagaata ataattaacc 
aagcctgaca actcattttc atattgtaat 
taatgctaaa tgtgtaataa tgtaacatga 
aatccttact aagtaggtga tgagtttgac 
ttctctataa agtatggtaa tagctttagt: 
aqtaaaagta gtgttgtcca ggaagtcaga 
tccatcgtat taaaaaacaa ttaactgccc 
actcttaagt cctaaaagtt ctcaatgtag 
catgtatatt acccaatgga aaatataatg 
tttttttttt ttcttcatgc ctgatgaaag 
aatctatgaa cctgaaaagg gtcagaagga 
cttaccccaa agagaaagag tttgaaactc 
tqgatgtgca ttagcctgga tcctcagttc 
atcctgggtt tccatccttg agattctgaa 
atttttctaa actccctggc tgttctgatc 
tcaggaagtt gccatgggaa acaaataatt 
acqcaggaag cctactattt aaatccttgg 
gctagciatt gcgaecttaa tttaacttnc 
tttggttttg acaggttttc caaaagtaaa 
qaactttccc cgtctcccgt cttctgccrc 
tacaaaaatt atgcaattca gtgttccaag 
gaaaaaacat tttaagtttt actgataatt 
tctatccaca ctgtcaaaca ggttggtgtg 
aattgctgat accatatgaa tgaaacatgg 
gcagatgatg ctttggaaga tgcagaagca 
aatctcaatg caagccccaa ctttcttatc 
qccagattgg ccaattaaaa acgaaaacct 
aaatacgttt gtgtttctag aatcacagct 
tgtacagcct tattttgttg gtgctttgca 
tcatgaggcc ggatgaaact tctcagtcca 
ctgaatttgt atatgactgc atttgtgggt 
qtttcctttt ggcctcctgc aaagtctcca 
tatttttatg atgacaatca aagccggcct 
ttagtgatgt ccttaaaatg tggtctgcca 
qagggacata agataaaatg atgttataca 
cttggagaat actgccaaaa catttatgac 
tttaactgtg ataatcccca caggcacatt 
atattttaga aataataaaa agaaagatac 
atgtgtgaga aaaactaact tgatagggtc 
tgttcatgtt tttgttttaa aacgtgtaaa 
ataatttaaa gtt 



tcccgttcag 
ttaaagagaa 
attttgaaat 
gtgtgaagtt 
cttcaaggac 
ttgtaattat 
tggacttctg 
gctgctgttg 
tggtactata 
aaccttgttt 
ctatgtttat 
tttccctcca 
agtttttgac 
gaattaaatt 
atttttaact 
tctgaaataa 
aggcataaac 
ateagcaaaa 
ctttggcaac 
tgcccagaca 
gagaccataa 
tcaaatgtgt 
gtatgaagtc 
ggccagtttt 
tgaactttgg 
cttcaggtta 
cagtcttagc 
gatgctactt 
ccactccata 
tctctgtgta 
tgaggttaga 
ggttcattgg 
gctgtgatta 
ataataaagt 
caactttttc 
gactaggttc 
caagcattct 
ttttgatatt 
gcagtttcca 
gtgtgtgtgt 
gaagaaaatt 
gagaaacact 
atctgtacaa 
tcaatatgta 
aagctgtatc 
aactgttgca 
rtacatgttc 
taccaataca 
tgaagatctt 



3720 
3780 
3840 
3900 
3960 
4020 
4080, 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4(580 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6633 



<210> 338 
<211> 994 
<212> DNA 
<213> Homo sapiens 



<400> 338 

tgctggccag 

aagcgcatga 

cagtgggagc 

gccagggtcc 

tggcggcagg 

atccagaaga 

tgcagctcag 

ccatttgaag 

gattccggca 



cacctcgagg 
gccgcagctc 
ggcccagcgg 
gctgctcgca 
agaagatcac 
tcaagtcggg 
ccaaggccag 
acgcctcgtt 
tccacatcat 



qaagatggcg gacgaggaga agctgccgcc 
aggccgagtg tactacttca accacatcac 
caacagcagc agtggtggca aaaacgggca 
cctgctggtg aagcacagcc agtcacggcg 
ccggaccaag gaggaggccc tggagctgat 
aqaggaggac tttgagtctc tggcctcaca 
gggagacctg ggtgccttca gcagaggtca 
tgcgctgcgg acgggggaga tgagcgggcc 
cctccgcact gagtgagggt ggggagccca 
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cggctgggag 60 
taacgccagc 120 
gggggagcct 180 
gccctcgtcc 240 
caacggctac 300 
gttcagcgac 360 
gatgcagaag 420 
cgtgttcacg 480 
ggcctggcct 540 



SUBSTITUTE SHEET (RULE 26) 



- <wo_ 



_03078662A1_I_> 



WO 03/078662 



PCT/US03/077.13 



cggggcaggg 
accccccact 
gcccttccag 
gaccgccaga 
gggcagtgtg 
tccccccagg 
tctgttcagt 
cagacccttt 



cagggcggct 
ccctgccacc 
attgggggcc 
ttctccctta 
gtgggagggg 
tgctggaggc 
cgcaaaggtg 
cacccccaat 



39740-000lPCT.txt 
aggccggcca gctccccctt gcccgccagc 
gtcacacagt atttattgtt cccacaatgg 
ctggggtccc cactccetgt ccatccccag 
aggaattgac ttcagcaggg gtgggaggct 
tgttccaaag agaaggcctg gtcagcagag 
agactcgagg gccgaattgt ttctagttag 
aacactcatg cggcagccat gggcectctg 
taaacccaga acca 



cagtggccga 600 
ctgggagggg 660 
ttggggctgc 720 
cccagaccca 780 
ccgccccgtg 840 
gccacgctcc 900 
agcaactgtg 960 
994 



<210> 339 
<2U> 772 
<212> DNA 
<213> Homo sapiens 



<400> 339 

agctcgtgcc 

tgttaacccc 

gatcgagctc 

cggagaattc 

cataaaggat 

cagtatttac 

cctgctttcc 

ctctaiagtgc 

tctagtgatg 

tgtggtgatc 

ttcttcttgg 

gcctgctgct 

attgtcttta 



gaattcggca 
gtggtgttct 
tttgcagacg 
aggaaagatg 
ttcatgattc 
cgggggccat 
atggcgaaca 
gattggctgg 
agaaagattg 
tcgcagtgtg 
tggtgttctt 
gccccatttg 
agttttcaac 



cgagccgggt 
ttgatgtcag 
ttgtgcctaa 
gggttccaat 
agggtggaga 
ttgcagatga 
gtggtccaag 
atgggaagca 
agaatgttcc 
gggagatgta 
gagtaagata 
atcaagagac 
tgtaaataaa 



cggagccatg 
tattggcggt 
gacggccgag 
aggatacaaa 
ttttgttaat 
aaattttaaa 
tacaaatggc 
tgtggtgttt 
cacaggcccc 
gtccagacaa 
atctggactg 
catggaagtg 
gtttttttgt 



gcggtggcaa 
caggaagttg 
aactttaggc 
ggaagcacct 
ggagatggta 
cttagacact 
tgtcagttct 
ggaaaaatca 
aacaataagc 
agactgaatc 
gcccccgtct 
tcagagattc 
atgcgtaaaa 



attcaagtcc 
gccgcatgaa 
agttctgcac 
tccacagggt 
ctggagtcgc 
cagctccagg 
ttatcacctg 
tcgatggact 
ccaagctacc 
aggccttccc 
ttgcttccct 
agaatccaag 
aa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

772 



<210> 340 
<211> 919 
<212> DNA 

<213> Homo sapiens 



<400> 340 

cgctcgcctc 

cctcccggtg 

ggcgtgggca 

cgacaaggct 

ctttctgcag 

cacagccaag 

caccttcaca 

ctttctgcag 

cgtacagtac 

cgtgttttcc 

gcaggaggag 

tggcagatca 

tctgattagc 

cccatcttcc 

gtgaatctgc 

taaactatag 



cctcgctcca 
ggcggattcc 
gcggccgagc 
cgcttctctg 
gacaacatcg 
ggccgagtcc 
gacaccgagg 
aaaggaaatg 
tcctgccgcc 
cgggacccca 
ctgtgcctgg 
gaaagaaacc 
tctcagtctt 
ctcagttccc 
ttgcctttcc 
tcacccgtg 



cgcgcgcccg 
tgggcaagat 
gcgactgccg 
ggacctggta 
tcgcggagtt 
gtcttttgaa 
accctgccaa 
atgaccactg 
tcctgaacct 
acggcctgcc 
ccaggcagta 
ttttgtagca 
cagctctatt 
ataaaacctt 
tgaaagtttc 



gacgcggcgg 
gaagtgggtg 
agtgagcagc 
cgccatggcc 
ctcggtggac 
taactgggac 
gttcaagatg 
gatcgtcgac 
cgatggcacc 
cccagaagcg 
caggctgatc 
atatcaagaa 
tatcttagga 
cattacacat 
tggggcttaa 



ccaggcttgc 
tgggcgctct 
ttccgagtca 
aagaaggacc 
gagaccggcc 
gtgtgcgcag 
aagtactggg 
acagactacg 
tgtgctgaca 
cagaagattg 
gtccacaacg 
tctagtttca 
gtttaatttg 
aaagatacac 
gattccagac 



gcgtggttcc 
tgctgttggc 
aggagaactt 
ccgagggcct 
agatgagcgc 
acatggtggg 
gcgtagcctc 
acacgtatgc 
gctactcctt 
taaggcagcg 
gttactgcga 
tctgagaact 
cccttctctc 
gtgggggtca 
tctgattcat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

919 



<210> 341 
<211> 7365 
<212> DNA 

<213> Homo sapiens 



<400> 341 

ggcagtttgt 

gggggaatcc 

acgtcggctg 

gtctctacta 

tgaggcagga 

attgcactcc 

aacaaaaaac 

tgaagaagta 

ggagttgatc 

gctgaaactt 



aggtcgcgag ggaagcgctg aggatcagga agggggcact 
tcgtgatagg aactggaata tgccttgagg gggacactat 
gtcatgaggt caggagttcc agaccagcct gaccaacgtg 
aaaatacaaa aattagccgg gcgtggtgcc gctccagcta 
gaatcgctag aacccgggag gcggaggttg cagtgagccg 
agcctgggcg acagagcgag actgtctcaa aacaaaacaa 
accggctgtt cattggaaca gaaagaaatg gatttatctg 
caaaatgtca ttaatgctat gcagaaaatc ttagagtgtc 
aaggaacctg tctccacaaa gtgtgaccac atattttgca 
ctcaaccaga agaaagggcc ttcacagtgt cctttatgta 
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gagtgtccgt 60 
gtctttaaaa 120 
gtgaaactcc 180 
ctcaggaggc 240 
agatcgcgcc 300 
aacaaaacaa 360 
ctcttcgcgt 420 
ccatctgtct 480 
aattttgcat 540 
agaatgatat 600 
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aaccaaaagg 
aatcatttgt 
aaaaaaggaa 
gggctacaga 
ggaaaccagt 
gcagcggata 
agataccgtt 
ccctcaagga 
ttctgagacg 
cactgagaag 
cttgcatgtg 
cagtttatta 
caaacagcct 
taatgatagg 
tgagagaaaa 
agatgttcct 
aagtgatgaa 
agtagctgat 
aatagactta 
ctccaaatca 
ggcaagcctc 
tgagccacag 
tacatcaggc 
tcctgaaatg 
tactaatagt 
. taacccaata 
cagcagtata 
gaataggctg 
tagaaatcta 
agagataaag 
catggaaggt 
aagtaaaaga 
ttttactaag 
agaagaaaaa 
agatctcatg 
tatttcattg 
tagcactcta 
tgaaaacccc 
ctttaagtat 
agaaagtgaa 
atttgctccg 
ctctgggtcc 
aaatcaagga 
ctttcctgtg 
aggctctagg 
aaataaacat 
atttgttaaa 
gtcacctgaa 
taataacatt 
ttccagtact 
tcaagcagaa 
tttgcaacct 
aaaaaagcaa 
gatttcagat 
gacacctgat 
tgacattaag 
gagtcctagc 
attagagtcc 
cttgttattt 
taccgagtgt 
tgactgcagt 
aacaaaatgt 
aaatacaaac 
tgaaagccag 
aacgggcttg 
agcatctggg 
gagtgacatt 
gcaggaaatg 



agcctacaag 
gcttttcagc 
aataactctc 
aaccgtgcca 
ctcagtgtcc 
caacctcaaa 
aataaggcaa 
accagggatg 
gatgtaacaa 
cgtgcagctg 
gagccatgtg 
ctcactaaag 
ggcttagcaa 
cggactccca 
gaatggaata 
tggataacac 
ctgttaggtt 
gtattggacg 
ctggccagtg 
gtagagagta 
cccaacttaa 
ataatacaag 
cttcatcctg 
ataaatcagg 
ggtcatgaga 
gaatcactcg 
agcaatatgg 
aggaggaagt 
agcccaccta 
aaaaaaaagt 
aaagaacctg 
catgacagcg 
tgttcaaata 
gaagagaaac 
ttaagtggag 
gtacctggta 
gggaaggcaa 
aagggactaa 
ccattgggac 
cttgatgctc 
ttttcaaatc 
ttaaagaaac 
aagaatgagt 
gttggtcaga 
ttttgtctat 
ggacttttac 
actaaatgta 
agagaaatgg 
agagaaaatg 
aatgaagtgg 
ctaggtagaa 
gaggtctata 
gaatatgaag 
aacttagaac 
gacctgttag 
gaaagttctg 
cctttcaccc 
tcagaagaga 
ggtaaagtaa 
ctgtctaaga 
aaccaggtaa 
tctgctagct 
acccaggatc 
ggagttggtc 
gaagaaaata 
tgtgagagtg 
ttaaccactc 
gctgaactag 



39740-000lPCT.txt 
aaagtacgag atttagtcaa cttgttgaag 
ttgacacagg tttggagtat gcaaacagct 
ctgaacatct aaaagatgaa gtttctatca 
aaagacttct acagagtgaa cccgaaaatc 
aactctctaa ccttggaact gtgagaactc 
agacgtctgt ctacattgaa ttgggatctg 
cttattgcag tgtgggagat caagaattgt 
aaatcagttt ggattctgca aaaaaggctg 
atactgaaca tcatcaaccc agtaataatg 
agaggcatcc agaaaagtat cagggtagtt 
gcacaaatac tcatgccagc tcattacagc 
acagaatgaa tgtagaaaag gctgaattct 
ggagccaaca taacagatgg gctggaagta 
gcacagaaaa aaaggtagat ctgaatgctg 
agcagaaaet gccatgctca gagaatccta 
taaatagcag cattcagaaa gttaatgagt 
ctgatgactc acatgatggg gagtctgaat 
ttctaaatga ggtagatgaa tattctggtt 
atcctcatga ggctttaata tgtaaaagtg 
atattgaaga caaaatattt gggaaaacct 
gccatgtaac tgaaaatcta attataggag 
agcgtcccct cacaaataaa ttaaagcgta 
aggattttat caagaaagca gatttggcag 
gaactaacca aacggagcag aatggtcaag 
ataaaacaaa aggtgattct attcagaatg 
aaaaagaatc tgctttcaaa acgaaagctg 
aactcgaatt aaatatccac aattcaaaag 
cttctaccag gcatattcat gcgcttgaac 
attgtactga attgcaaatt gatagttgtt 
acaaccaaat gccagtcagg cacagcagaa 
caactggagc caagaagagt aacaagccaa 
atactttccc agagctgaag ttaacaaatg 
ccagtgaact taaagaattt gtcaatccta 
tagaaacagt taaagtgtct aataatgctg 
aaagggtttt gcaaactgaa agatctgtag 
ctgattatgg cactcaggaa agtatctcgt 
aaacagaacc aaataaatgt gtgagtcagt 
ttcatggttg ttccaaagat aatagaaatg 
atgaagttaa ccacagtcgg gaaacaagca 
agtatttgca gaatacattc aaggtttcaa 
caggaaatgc agaagaggaa tgtgcaacat 
aaagtccaaa agtcactttt gaatgtgaac 
ctaatatcaa gcctgtacag acagttaata 
aagataagcc agttgataat gccaaatgta 
catctcagtt cagaggcaac gaaactggac 
aaaacccata tcgtatacca ccactttttc 
agaaaaatct gctagaggaa aactttgagg 
gaaatgagaa cattccaagt acagtgagca 
tttttaaaga agccagctca agcaatatta 
gctccagtat taatgaaata ggttccagtg 
acagagggcc aaaattgaat gctatgctta 
aacaaagtct tcctggaagt aattgtaagc 
aagtagttca gactgttaiat acagatttct 
agcctatggg aagtagtcat gcatctcagg 
atgatggtga aataaaggaa gatactagtt 
ctgtttttag caaaagcgtc cagaaaggag 
atacacattt ggctcagggt taccgaagag 
acttatctag tgaggatgaa gagcttccct 
acaatatacc ttctcagtct actaggcata 
acacagagga gaatttatta tcattgaaga 
tattggcaaa ggcatctcag gaacatcacc 
tgttttcttc acagtgcagt gaattggaag 
ctttcttgat tggttcttcc aaacaaatga 
tgagtgacaa ggaattggtt tcagatgatg 
atcaagaaga gcaaagcatg gattcaaact 
aaacaagcgt ctctgaagac tgctcagggc 
agcagaggga taccatgcaa cataacctga 
aagctgtgtt agaacagcat gggagccagc 
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agctattgaa 
ataattttgc 
tccaaagtat 
cttccttgca 
tgaggacaaa 
attcttctga 
tacaaatcac 
cttgtgaatt 
atttrgaacac 
ctgtttcaaa 
atgagaacag 
gtaataaaag 
aggaaacatg 
atcccctgtg 
gagatactga 
ggttttccag 
caaatgccaa 
cttcagagaa 
aaagagttca 
atcggaagaa 
catntgttac 
aaaggagacc 
ttcaaaagac 
tgatgaatat 
agaaaaatcc 
aacctataag 
cacctaaaaa 
tagtagtcag 
ctagcagtga 
acctacaact 
atgaacagac 
cacctggttc 
gccttccaag 
aagaccccaa 
agagtagcag 
tactggaagt 
gtgcagcatt 
acacagaagg 
tagaaatgga 
agcgccagtc 
tctctgccca 
aaaaggaaga 
tcactgcagg 
gtatcaaagg 
tcattactcc 
ccatcaagtc 
aacattcaat 
caattagccg 
atgaagtagg 
atgaaaacat 
gattaggggt 
atcctgaaat 
ctccatatct 
tttgttctga 
ttgctgaaaa 
agcttagcag 
gggccaagaa 
gcttccaaca 
gcaccgttgc 
atagcttaaa 
ttagtgagga 
acttgactgc 
ggcatcagtc 
aagaaagagg 
taggtgaagc 
tatcctctca 
taaagctcca 
cttctaacag 



660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120* 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
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etacccttcc atcataagtg actcttctgc ccttgaggac ctgcgaaatc cagaacaaag 4740 
cacatcagaa aaagcagtat taacttcaca gaaaagtagt gaatacccta taagccagaa 4800 
tccagaaggc ctttctgctg acaagtttga ggtgtctgca gatagttcta ccagtaaaaa 4860 
taaagaacca ggagtggaaa ggtcatcccc ttctaaatgc ccatcattag atgataggtg 4920 
gtacatgcac agttgctctg ggagtcttca gaatagaaac tacccatctc aagaggagct 4980 
cattaaggtt gttgatgtgg aggagcaaca gctggaagag tctgggccac acgatttgac 5040 
ggaaacatct tacttgccaa ggcaagatct agagggaacc ccttacctgg aatctggaat 5100 
cagcctcttc tctgatgacc ctgaatctga tccttctgaa gacagagccc cagagtcagc 5160 
tcgtgttggc aacataccat cttcaacctc tgcattgaaa gttccceaat tgaaagttgc 5220 
agaatctgcc cagagtccag ctgctgctca tactactgat actgctgggt ataatgcaat 5280 
ggaagaaagt gtgagcaggg agaagccaga attgacagct tcaacagaaa gggtcaacaa 5340 
aagaatgtcc atggtggtgt ctggcctgac cccagaagaa tttatgctcg tgtacaagtt 5400 
tgccagaaaa caccacatca ctttaactaa tctaattact gaagagacta ctcatgttgt 5460 
tatgaaaaca gatgctgagt ttgtgtgtga acggacactg aaatattttc taggaattgc 5520 
gggaggaaaa tgggtagtta gctatttctg ggtgacccag tctattaaag aaagaaaaat 5580 
gctgaatgag catgattttg aagtcagagg agatgtggtc aatggaagaa accaccaagg 5640 
tccaaagcga gcaagagaat cccaggacag aaagatcttc agggggctag aaatctgttg 5700 
ctatgggccc ttcaccaaca tgcccacaga tcaactggaa tggatggtac agctgtgtgg 5760 
tgcttctgtg gtgaaggagc tttcatcatt cacccttggc acaggtgtcc acccaattgt 5820 
ggttgtgcag ccagatgcct ggacagagga caatggcttc catgcaattg ggcagatgtg 5880 
tgaggcacct gtggtgaccc gagagtgggt gttggacagt gtagcactct accagtgcca 5940 
ggagctggac acctacctga taccccagat cccccacagc cactactgac tgcagccagc 6000 
cacaggtaca gagccacagg accccaagaa tgagcttaca aagtggcctt tccaggccct 6060 
gggagctcct ctcactcttc agtccttcta ctgtcctggc tactaaatat tttatgtaca 6120 
tcagcctgaa aaggacttct ggctatgcaa gggtccctta aagattttct gcttgaagtc 6180 
tcccttggaa atctgccatg agcacaaaat tatggtaatt tttcacctga gaagatttta 6240 
aaaccattta aacgccacca attgagcaag atgctgattc attatttatc agccctattc 6300 
tttctattca ggctgttgtt ggcttagggc tggaagcaca gagtggcttg gcctcaagag 6360 
aatagctggt ttccctaagt ttacttctct aaaaccctgt gttcacaaag gcagagagtc 6420 
agacccttca atggaaggag agtgcttggg atcgattatg tgacttaaag tcagaatagt 6480 
ccttgggcag ttctcaaatg ttggagtgga acattgggga ggaaattctg aggcaggtat 6540 
tagaaatgaa aaggaaactt gaaacctggg catggtggct cacgcctgta atcccagcac 6600 
tttgggaggc caaggtgggc agatcactgg aggtcaggag ttcgaaacca gcctggccaa 6660 
catiggtgaaa ccccatctct actaaaaata cagaaattag ccggtcatgg tggtggacac 6720 
ctgtaatccc agctactcag gtggctaagg caggagaatc acttcagccc gggaggtgga 6780 
ggrtgcagtg agccaagatc ataccacggc actccagcct gggtgacagt gagactgtgg 6840 
ctcaaaaaaa aaaaaaaaaa aggaaaatga aactaggaaa ggtttcttaa agtctgagat 6900 
atatttgcta gatttctaaa gaatgtgttc taaaacagca gaagattttc aagaaccggt 6960 
ttccaaagac agtcttctaa ttcctcatta gtaataagta aaatgtttat tgttgtagct 7020 
ctggtatata atccattcct cttaaaatat aagacctctg gcatgaatat ttcatatcta 7080 
taaaatgaca gatcccacca ggaaggaagc tgttgctttc tttgaggtga tttttttcct 7140 
ttgctccctg ttgctgaaac catacagctt cataaataat tttgcttgct gaaggaagaa 7200 
aaagtgtttt tcataaaccc attatccagg actgtttata gctgttggaa ggactaggtc 7260 
ttccctagcc cccccagtgt gcaagggcag tgaagacttg attgtacaaa atacgttttg 7320 
taaatgttgt gctgttaaca ctgcaaataa acttggtagc aaaca 7365 



<210> 342 

<211> 10386 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> unsure 

<222> (0)...(0) 

<223> n = a, t, c or g 

<400> 342 

attgaggact cggaaatgag gtccaagggt agccaaggat ggctgcagct tcatatgatc 60 

agttgttaaa gcaagttgag gcactgaaga tggagaactc aaatcttcga caagagctag 120 

aagataattc caatcatctt acaaaactgg aaactgaggc atctaatatg aaggaagtac 180 

ttaaacaact acaaggaagt attgaagatg aagctatggc ttcttctgga cagattgatt 240 

tattagagcg tcttaaagag cttaacttag atagcagtaa tttccctgga gtaaaactgc 300 

ggtcaaaaat gtccctccgt tcttatggaa gccgggaagg atctgtatca agccgttctg 360 

gagagtgcag tcctgttcct atgggttcat ttccaagaag agggtttgta aatggaagca 420 

gagaaagtac tggatattta gaagaacttg agaaagagag gtcattgctt cttgctgatc 480 

ttgacaaaga agaaaaggaa aaagacrggt attacgctca acttcagaat ctcactaaaa 540 
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gaatagatag 
aattggaata 
aggatatgga 
tacttcgtat 
acaagcatga 
aaatcaacat 
cagccagtgt 
tgggaaccaa 
atgatatgtc 
gacagtctgg 
tattgttggg 
acaacatcat 
atcttttgga 
aaccaggcat 
ctgctgtgtg 
aactaggggg 
ttactaatga 
tgacttttgg 
cacttgtggc 
tgaggaattt 
gtgtgaaagc 
tattgagtgc 
ctgtagatgg 
ctttagccat 
caaatgagga 
acttaaaatc 
cagcaagaaa 
agaacctcat 
atctcatggc 
gcttgccatc 
acttatcaga 
agcagagaca 
ataataggtc 
ctacagtgtt 
aagatagaag 
aaaatccagg 
ccaaagtcat 
ctaccactga 
cccatacaca 
ctatgcctta 
gtagtagtga 
aagatgatga 
tacatagtgc 
gtcttaaata 
gatgggcaag 
caaggaatca 
agttccaacc 
atggttcaga 
ctttgtgtca 
ctgaagaaga 
aagagaaacg 
cttcatcaca 
ccgaacatat 
agaatcagct 
cttgcaaagt 
caatatgttt 
gatgtaatca 
aagaaaagat 
agcaccctag 
ggcacaaagc 
cacccaaaag 
cttctgtcag 
aaccatgcag 
gacaaaccat 
ccaagcgaga 
agcaagctgc 
tattacattt 
ctctgagcct 



tcttccttta 
tgaagcaagg 
aaaacgagca 
acgacagctt 
aaccggctca 
ggcaacttct 
tttgagttct 
ggtggaaatg 
gcgaactttg 
atgtcttcct 
aaattcccgg 
tcactcacag 
acagatacgc 
ggaccaggac 
tgttctaatg 
actacaggcc 
ccactacagt 
agatgtagcc 
ccaactaaaa 
gtcttggcga 
attgatggaa 
cttatggaat 
tgcacttgca 
tattgaaagt 
ccacaggcaa 
tcatagtttg 
tcctaaagac 
tcattcaaag 
aaataggcct 
tcttcatgtt 
aacttttgac 
caagcaaagt 
agacaatttt 
acccagctcc 
tttggagaga 
aacttcttca 
ggaagaagtg 
attacattgt 
ttcaaacact 
tgccaaatta 
tggttatggt 
aagtaagttt 
aaatcatatg 
ttcagatgag 
acccaaacac 
aagtacaact 
acattttgga 
aacaaatcga 
agaagatgac 
acagcatgaa 
tcatgtggat 
gaaacagtca 
gtcttcaagc 
ccatccaagt 
ttcttctatt 
ttcaagatgt 
gacgacacag 
tggaactagg 
aaccaaatcc 
tgttgaattt 
tccacctgaa 
ttcacttgat 
tggaatggta 
gccaccaagc 
agtacctaaa 
agtaaatgct 
tgccacggaa 
cgatgagcca 
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actgaaaatt tttccttaca aacagatatg 
caaatcagag ttgcgatgga agaacaacta 
cagcgaagaa tagccagaat tcagcaaatc 
ttacagtccc aagcaacaga agcagagagg 
catgatgctg agcggcagaa tgaaggtcaa 
ggtaatggtc agggttcaac tacacgaatg 
agtagcacac actctgcacc tcgaaggctg 
gtgtattcat tgttgtcaat gcttggtact 
ctagctatgt ctagctccca agacagctgt 
ctcctcatcc agcttttaca tggcaatgac 
ggcagtaaag aggctcgggc cagggccagt 
cctgatgaca agagaggcag gcgtgaaatc 
gcttactgtg aaacctgttg ggagtggcag 
aaaaatccaa tgccagctcc tgttgaacat 
aaactttcat ttgatgaaga gcatagacat 
attgcagaat tattgcaagt ggactgtgaa 
attacactaa gacgatatgc tggaatggct 
aacaaggcta cgctatgctc tatgaaaggc 
tctgaaagtg aagacttaca gcaggttatt 
gcagatgtaa atagtaaaaa gacgttgcga 
tgtgctttag aagttaaaaa ggaatcaacc 
ttgtcagcac attgcactga gaataaagct 
tttttggttg gcactcttac ttaccggagc 
ggaggtggga tattacggaa tgtgtccagc 
atcctaagag agaacaactg tctacaaact 
acaatagtca gtaatgcatg tggaactttg 
caggaagcat tatgggacat gggggcagtt 
cacaaaatga ttgctatggg aagtgctgca 
gcgaagtaca aggatgccaa tattatgtct 
aggaaacaaa aagccctaga agcagaatta 
aatatagaca atttaagtcc caaggcatct 
ctctatggtg attatgtttt tgacaccaat 
aatactggca acatgactgt cctttcacca 
tcttcatcaa gaggaagctt agatagttct 
gaacgcggaa ttggtctagg caactaccat 
aagcgaggtt tgcagatctc caccactgca 
tcagccattc atacctctca ggaagacaga 
gtgacagatg agagaaatgc acttagaaga 
tacaatttca ctaagtcgga aaattcaaat 
gaatacaaga gatcttcaaa tgatagttta 
aaaagaggtc aaatgaaacc ctcgattgaa 
tgcagttatg gtcaataccc agccgaccta 
gatgataatg atggagaact agatacacca 
cagttgaact ctggaaggca aagtccttca 
ataatagaag atgaaataaa acaaagtgag 
tatcctgttt atactgagag cactgatgat 
cagcaggaat gtgtttctcc atacaggtca 
gtgggttcta atcatggaat taatcaaaat 
tatgaagatg ataagcctac caattatagt 
gaagaagaga gaccaacaaa ttatagcata 
cagcctattg attatagttt aaaatatgcc 
ttttcattct caaagagttc atctggacaa 
agtgagaata cgtccacacc ttcatctaat 
tctgcacaga gtagaagtgg tcagcctcaa 
aaccaagaaa caatacagac ttattgtgta 
agttcattat catctttgtc atcagctgaa 
gaagcagatt ctgctaatac cctgcaaata 
tcagctgaag atcctgtgag cgaagttcca 
agcagactgc agggttctag tttatcttca 
tcttcaggag cgaaatctcc ctccaaaagt 
cactatgttc aggagacccc actcatgttt 
agttttgaga gtcgttcgat tgccagctcc 
agtggcatta taagccccag tgatcttcca 
agaagtaaaa cacctccacc acctcctcaa 
aataaagcac ctactgctga aaagagagag 
gcagttcaga gggtccaggt tcttccagat 
agtactccag atggattttc ttgttcatcc 
tttatacaga aagatgtgga attaagaata 
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accagaaggc 
ggtacctgcc 
gaaaaggaca 
tcatctcaga 
ggagtgggag 
gaccatgaaa 
acaagtcatc 
catgataagg 
atatccatgc 
aaagactctg 
gcagcactcc 
cgagtccttc 
gaagctcatg 
cagatctgtc 
gcaatgaatg 
atgtaegggc 
ttgacaaact 
tgcatgagag 
gcaagtgttt 
gaagttggaa 
ctcaaaagcg 
gatatatgtg 
cagacaaaca 
ttgatagcta 
ttattacaac 
tggaatctct 
agcatgctca 
gctttaagga 
cctggctcaa 
gatgctcagc 
catcgtagta 
cgacatgatg 
tatttgaata 
cgttctgaaa 
ccagcaacag 
gcccagattg 
agttctgggt 
agctctgctg 
aggacatgtt 
aatagtgtca 
tcctattctg 
gcccataaaa 
ataaattata 
cagaatgaaa 
caaagacaat 
aaacacctca 
cggggagcca 
gtaagccagt 
gaacgttact 
aaatataatg 
acagatattc 
agcagtaaaa 
gccaagaggc 
aaggctgcca 
gaagatactc 
gatgaaatag 
gcagaaataa 
gcagtgtcac 
gaatcagcca 
ggtgctcaga 
agcagatgta 
gttcagagtg 
gatagccctg 
acagctcaaa 
agtggaccta 
gctgatactt 
agcctgagtg 
atgcctccag 



600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
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ttcaggaaaa tgacaatggg aatgaaacag aatcagagca gcctaaagaa tcaaatgaaa 4680 
accaagagaa agaggcagaa aaaactattg attctgaaaa ggacctatta gatgattcag 4740 
atgatgatga tattgaaata ctagaagaat gtattatttc tgccatgcca acaaagtcat 4800 
cacgtaaagc aaaaaagcca gcccagactg cttcaaaatt acctccacct gtggcaagga 4860 
aaccaagtca gctgcctgtg tacaaacttc taccatcaca aaacaggttg caaccccaaa 4920 
agcatgttag ttttacaccg ggggatgata tgccacgggt gtattgtgtt gaagggacac 4980 
ctataaactt ttccacagct acatctctaa gtgatctaac aatcgaatcc cctccaaatg 5040 
agttagctgc tggagaagga gttagaggag gagcacagtc aggtgaattt gaaaaacgag 5100 
ataccattcc tacagaaggc agaagtacag atgaggctca aggaggaaaa acctcatctg 5160 
taaccatacc tgaattggat gacaataaag cagaggaagg tgatattctt gcagaatgca 5220 
ttaattctgc tatgcccaaa gggaaaagtc acaagccttt ccgtgtgaaa aagataatgg 5280 
accaggtcca gcaagcatct gcgtcgtctt ctgcacccaa caaaaatcag ttagatggta 5340 
agaaaaagaa accaacttca ccagtaaaac ctataccaca aaatactgaa tataggacac 5400 
gtgtaagaaa aaatgcagac tcaaaaaata atttaaatgc tgagagagtt ttctcagaca 5460 
acaaagattc aaagaaacag aatttgaaaa ataattccaa ggacttcaat gataagctcc 5520 
caaataatga agatagagtc agaggaagtt ttgcttttga ttcacctcat cattacacgc 5580 
ctattgaagg aactccttac tgtttttcac gaaatgattc tttgagttct ctagattttg 5640 
atgatgatga tgttgacctt tccagggaaa aggctgaatt aagaaaggca aaagaaaata 5700 
aggaatcaga ggctaaagtt accagccaca cagaactaac ctccaaccaa caatcagcta 5760 
ataagacaca agctattgca aagcagccaa taaatcgagg tcagcctaaa cccatacttc 5820 
agaaacaatc cacttttccc cagtcatcca aagacatacc agacagaggg gcagcaactg 5880 
atgaaaagtt acagaatttt gctattgaaa atactccagt ttgcttttct cataattcct 5940 
ctctgagttc tctcagtgac attgaccaag aaaacaacaa taaagaaaat gaacctatca 6000 
aagagactga gccccctgac tcacagggag aaccaagtaa acctcaagca tcaggctatg 6060 
ctcctaaatc atttcatgtt gaagataccc cagtttgttt ctcaagaaac agttctctca 6120 
gttctcttag tattgactct gaagatgacc tgttgcagga atgtataagc tccgcaatgc 6180 
caaaaaagaa aaagccttca agactcaagg gtgataatga: aaaacatagt cccagaaata 6240 
tgggtggcat attaggtgaa gatctgacac ttgatttgaa agatatacag agaccagatt 6300 
cagaacatgg tctatcccct gattcagaaa attttgattg gaaagctatt caggaaggtg 6360 
caaattccat agtaagtagt ttacatcaag ctgctgctgc tgcatgttta tctagacaag 6420 
cttcgtctga ttcagattcc atcctttccc tgaaatcagg aatctctctg ggatcaccat 6480 
ttcatcttac acctgatcaa gaagaaaaac cctttacaag taataaaggc ccacgaattc 6540 
taaaaccagg ggagaaaagt acattggaaa ctaaaaagat agaatctgaa agtaaaggaa 6600 
tcaaaggagg aaaaaaagtt tataaaagtt tgattactgg aaaagttcga tctaattcag 6660 
aaatttcagg ccaaatgaaa cagccccttc aagcaaacat gccttcaatc tctcgaggca 6720 
ggacaatgat tcatattcca ggagttcgaa atagctcctc aagtacaagt cctgtttcta 6780 
aaaaaggccc accccttaag actccagcct ccaaaagccc tagtgaaggt caaacagcca 6840 
ccacttctcc tagaggagcc aagccatctg tgaaatcaga attaagccct gttgccaggc 6900 
agacatccca aataggtggg tcaagtaaag caccttctag atcaggatct agagattcga 6960 
ccccttcaag acctgcccag caaccattaa gtagacctat acagtctcct ggccgaaact 7020 
caatttcccc tggtagaaat ggaataagtc ctcctaacaa attatctcaa cttccaagga 7080 
catcatcccc tagtactgct tcaactaagt cctcaggttc tggaaaaatg tcatatacat 7140 
ctccaggtag acagatgagc caacagaacc ttaccaaaca aacaggttta tccaagaatg 7200 
ccagtagtat tccaagaagt gagtctgcct ccaaaggact aaatcagatg aataatggta 7260 
atggagccaa taaaaaggta gaactttcta gaatgtcttc aactaaatca agtggaagtg 7320 
aatctgatag atcagaaaga cctgtattag tacgccagtc aactttcatc aaagaagctc 7380 
caagcccaac cttaagaaga aaattggagg aatctgcttc atttgaatct ctttctccat 7440 
catctagacc agcttctccc actaggtccc aggcacaaac tccagtttta agtccttccc 7500 
ttcctgatat gtctctatcc acacattcgt ctgttcaggc tggtggatgg cgaaaactcc 7560 
cacctaatct cagtcccact atagagtata atgatggaag accagcaaag cgccatgata 7620 
ttgcacggtc tcattctgaa agtccttcta gacttccaat caataggtca ggaacctgga 7680 
aacgtgagca cagcaaacat tcatcatccc ttcctcgagt aagcacttgg agaagaactg 7740 
gaagttcatc ttcaattctt tctgcttcat cagaatccag tgaaaaagca aaaagtgagg 7800 
atgaaaaaca tgtgaactct atttcaggaa ccaaacaaag taaagaaaac caagtatccg 7860 
caaaaggaac atggagaaaa ataaaagaaa atgaattttc tcccacaaat agtacttctc 7920 
agaccgtttc ctcaggtgct acaaatggtg ctgaatcaaa gactctaatt tatcaaatgg 7980 
cacctgctgt ttctaaaaca gaggatgttt gggtgagaat tgaggactgt cccattaaca 8040 
atcctagatc tggaagatct cccacaggta atactccccc ggtgattgac agtgtttcag 8100 
aaaaggcaaa tccaaacatt aaagattcaa aagataatca ggcaaaacaa aatgtgggta 8160 
atggcagtgt tcccatgcgt accgtgggtt tggaaaatcg cctgaactcc tttattcagg 8220 
tggatgcccc tgaccaaaaa ggaact gaga taaaaccagg acaaaataat cctgtccctg 8280 
tatcagagac taatgaaagt tctatagtgg aacgtacccc attcagttct agcagctcaa 8340 
gcaaacacag ttcacctagt gggactgttg ctgccagagt gactcctttt aattacaacc 8400 
caagccctag gaaaagcagc gcagatagca cttcagctcg gccatctcag atcccaactc 8460 
cagtgaataa caacacaaag aagcgagatt ccaaaactga cagcacagaa tccagtggaa 8520 
cccaaagtcc taagcgccat tctgggtctt accttgtgac atctgtttaa aagagaggaa 8580 
gaatgaaact aagaaaattc tatgttaatt acaactgcta tatagacatt ttgtttcaaa 8640 
tgaaacttta aaagactgaa aaattttgta aataggtttg attcttgtta gagggttttt 8700 
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gttctggaag 

tcttgatggt 

tatttaaagt 

cact^cagat 

ctaaacttac 

agtccatcta 

atgaacaaigc 

tcagaagtaa 

aataatttac 

gaatgaaact 

ctgtaatttg 

catgaattgt 

aaactgttac 

tgtttaaaat 

agagtaaaat 

ttccatgcgt 

ttttaacatt 

attcatgtaa 

acataatttc 

ttgcagtctc 

actgggtctg 

aagtataatc 

gatttaatac 

tttatttgtg 

gaagtcttcc 

tatatcttcc 

tttgttgaat 

agagaaaacc 

ggtcat 



39740-000lPCT.txt 
ccatatttga tagtatactt tgtcttcact ggtcttattt tgggaggcac 
taqgaaaaaa atagtaaagc caagtatgtt tgtacagtat gttttacatg 
aqcatcccat cccaacttcc tttaattatt gcttgtctta aaataatgaa 
agaaaatatg atatattgct gttatcaatc atttctagat tataaactga 
atcagggaaa aattggtatt tatgcaaaaa aaaatgtttt tgtccttgtg 
acatcataat taatcatgtg gctgtgaaat tcacagtaat atggttcccg 
tttacccagc ctgtttgctt tactgcatga atgaaactga tggttcaatt 
tgattaacag ttatgtggtc acatgatgtg catagagata gctacagtgt 
actattttgt gctccaaaca aaacaaaaat ctgtgtaact gtaaaacatt 
attttacctg aactagattt tatctgaaag taggtagaat ttttgctatg 
ttgtatattc tggtatttga ggtgagatgg ctgctctttt attaatgaga 
qtctcaacag aaactaaatg aacatttcag aataaattat tgctgtatgt 
tgaaattggt atttgtttga agggtcttgt ttcacatttg tattaataat 
gcctctttta aaagcttata taaatttttt ncttcagctt ctatgcatta 
tcctcttact gtaataaaaa caattgaaga agactgttgc cacttaacca 
tqgcacttat ctattcctga aattctttta tgtgattagc tcatcttgat 
tttccactta aacttttttt tcttactcca ctggagctca gtaaaagtaa 
tagcaatgca agcagcctag cacagactaa gcattgagca taataggccc 
ctctttctta atattataga aattctgtac ttgaaattga ttcttagaca 
ttcgaggctt tacagtgtaa actgtcttgc cccttcatct tcttgttgca 
acatgaacac tttttatcac cctgtatgtt agggcaagat ctcagcagtg 
aqcactttgc catgctcaga aaattcaaat cacatggaac tttagaggta 
gattaagata ttcagaagta tattttagaa tccctgcctg ttaaggaaac 
qtaggtacag ttctggggta catgttaagt gtccccttat acagtggagg 
ttcctgaagg aaaataaact gacacttatt aactaagata atttacttaa 
ctgatttgtt ttaaaagatc agagggtgac tgatgataca tgcatacata 
aaatgaaaat ttatttttag tgataagatt catacactct gtatttgggg 
tttttaagca tggtggggca ctcagatagg agtgaataca cctacctggt 



8760 

8820 

8880 

8940 

9000 

9060 

9120 

9180 

9240 

9300 

9360 

9420 

9480 

9540 

9600 

9660 

9720 

9780 

9840 

9900 

9960 

10020 

10080 

10140 

10200 

10260 

10320 

10380 

10386 



<210> 343 

<211> 2191 

<212> DNA 

<213> Homo sapiens 



<400> 343 

ggtggccgag 

tcgggccgtt 

agagcccgtc 

ctgacaaccg 

gccccaccgt 

tgcggcattt 

cccaggacct 

tgtgggtgaa 

acatcagcaa 

tcaacaacac 

cctccaagta 

ctggactgca 

ccgtcaccac 

gcagtaacct 

atgggactgg 

tgatgcacga 

cactggggcc 

ccaagagcca 

aggatgcgga 

tgcttcgctg 

tgatgcagat 

tggcgctgcc 

aagtggtgcg 

ctgcgtccca 

ccttggaccc 

gggctccgta 

ggcacctgga 

atcagaagcc 

atccacctct 

gtctggatca 



cgggggaccg 

gttgtggggc 

gcgggagtgc 

acgccggggc 

ggacatgcca 

tgtcggctgg 

gcgcacaaga 

tggggtcgac 

cctggtccag 

actcaccccc 

tcccaagggt 

gcggtctgta 

cagcgtggag 

gttcaagttg 

gacccagggc 

acgccctgcc 

tgtgtctgac 

gttcctcatc 

catccgaggg 

gcttggtgcc 

gtgtgaccgc 

gcagttcttc 

tagggacaag 

cctagaatct 

ctcccggcct 

tgtggatgtg 

gttgattcag 

cattattcag 

gatgttcact 

aaaacgcaga 



ggaagcatgg cccgggggtc ggcggttgcc 
tgcgcgctgg ggctgcaggg cgggatgctg 
aaggagctgg acggcctctg gagcttccgc 
ttcgaggagc agtggtaccg gcggccgctg 
gttccctcca gcttcaatga catcagccag 
gtgtggtacg aacgggaggt gatcctgccg 
gtggtgctga ggattggcag tgcccattcc 
acgctagagc atgagggggg ctacctcccc 
gtggggcccc tgccctcccg gctccgaatc 
accaccctgc caccagggac catccaatac 
tactttgtcc agaacacata ttttgacttt 
cttctgtaca cgacacccac cacctacatc 
caagacagtg ggctggtgaa ttaccagatc 
gaagtgcgtc ttttggatgc agaaaacaaa 
caacttaagg tgccaggtgt cagcctctgg 
tatctgtatt cattggaggt gcagctgact 
ttctacacac tccctgtggg gatccgcact 
aatgggaaac ctttctattt ccacggtgtc 
aagggcttcg actggccgct gctggtgaag 
aacgctttcc gtaccagcca ctacccctat 
tatgggattg tggtcatcga tgagtgtccc 
aacaacgttt ctctgcatca ccacatgcag 
aaccaccccg cggtcgtgat gtggtctgtg 
gctggctact acttgaagat ggtgatcgct 
gtgacctttg tgagcaactc taactatgca 
atctgtttga acagctacta ctcttggtat 
ctgcagctgg ccacccagtt tgagaactgg 
agcgagtatg gagcagaaac gattgcaggg 
gaagagtacc agaaaagtct gctagagcag 
aaatatgtgg ttggagagct catttggaat 
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tgggcggcgc 60 
tacccccagg 120 
gccgacttct 180 
tgggagtcag 240 
gactggcgtc 300 
gagcgatgga 360 
tatgccatcg 420 
ttcgaggccg 480 
actatcgcca 540 
ctgactgaca 600 
ttcaactacg 660 
gatgacatca 720 
tctgtcaagg 780 
gtcgtggcga 840 
tggccgtacc 900 
gcacagacgt 960 
gtggctgtca 1020 
aacaagcatg 1080 
gacttcaacc 1140 
gcagaggaag 1200 
ggcgtgggcc 1260 
gtgatggaag 1320 
gccaacgagc 1380 
cacaccaaat 1440 
gcagacaagg 1500 
cacgactacg 1560 
tataagaagt 1620 
tttcaccagg 1680 
taccatctgg 1740 
tttgccgatt 1800 
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tcatgactga 
agagacaacc 
aaaccaggta 
gagcaagact 
agcagaacaa 
tgtggtcatc 
aaataaagag 



acagtcaccg 
aaaaagtgca 
tccccaictca 
gataccacct 
gtgcctcctg 
tattctagca 
ttggcatgaa 



39740-000lPCT.txt 
acgagagtgc tggggaataa aaaggggatc ttcactcggc 
gcgttccttt tgcgagagag atactggaag attgccaatg 
gtagccaagt cacaatgttt ggaaaacagc ccgtttactt 
gcgtgtccct tcctccccga gtcagggcga cttccacagc 
gactgttcac ggcagaccag aacgtttctg gcctgggttt 
gggaacacta aaggtggaaa taaaagattt tctattatgg 
agtcgctact g 



1860 
1920 
1980 
2040 
2100 
2160 
2191 



<210> 344 
<211> 2776 
<212> DNA 

<213> Homo sapiens 



<400> 344 
cagggcagac 
acaccgaggg 
ctcgggtgcg 
atcgagtcgg 
cctgactcca 
gacatcatgg 
cagctgtccc 
gtgaccttcc 
tactccatgc 
ctcaacgaga 
ctgccgttcg 
gagtgccagc 
tttcagaccg 
ctggacgcca 
cggctgctgg 
gccatcctga 
aacgaattcg 
cagcccatct 
atccceaagt 
aagaatgctt 
accctgaaag 
agaggtgact 
gccacagagt 
gtgaccgtgc 
agcctctgcc 
attgggaaaa 
tgccggaagg 
tgcctgtgcc 
gacaccatca 
tgcttctgcg 
aggaccactg 
cgctgcaacg 
ggctgcccct 
ggcccctttg 
gtgaagggca 
gagcagcagg 
gcaggcccca 
attctcctgc 
cgctttgaga 
gccaccacga 
aggccgtcag 
ctcttgagga 
ccatggccgg 
agtctttgca 
gtgcaagtca 
tcagggtata 
aaaataaaac 



tggtagcaaa 
acatgctggg 
tcctctctca 
ggcccggctg 
ttcgctgcga 
accccacaag 
cacaaaaagt 
ggcgggccaa 
ttgatgacct 
tcaccgagtc 
tgaacacgca 
ccccgtttgc 
aggtcgggaa 
tgatgcaggt 
tgtttgccae 
cccccaacga 
actacccatc 
tcgcggtgac 
cagccgtggg 
acaataaact 
tcacctacga 
gtgatggcgt 
gcatccagga 
aggttcttcc 
atggcaaggg 
actgtgagtg 
acaacaactc 
acaccagcga 
actgtgagcg 
ggaagtgccg 
agggctgcct 
tatgcgagtg 
caccctgtgg 
ggaagaactg 
ggacctgcaa 
acgggatgga 
acatcgccgc 
tggtcatctg 
aggagaagct 
cggtcatgaa 
gacccaccat 
tgtcaccaat 
ccggtgcttc 
tggagacttg 
ggacatcagt 
aaatgacatc 
ttcaat 



gcccccacgc 
cctgcgcccc 
ggagtgcacg 
cacctggtgc 
cacccggcca 
cctcgctgaa 
gacgctttac 
gggctacccc 
caggaatgtc 
cggccgcatt 
ccctgataag 
cttcaggcac 
gcagctgatt 
cgccgcctgc 
tgatgacggc 
cggccgctgt 
ggtgggccag 
cagtaggatg 
ggagctgtct 
ctcctccagg 
ctccttctgc 
gcagatcaat 
gcagtcgttt 
ccagtgtgag 
cttcttggag 
ccagacacag 
catcatctgc 
cgtccccggc 
ctacaacggc 
ctgccacccg 
gaacccgcgg 
ccattcaggc 
caagtacatc 
cagcgcggcg 
ggagagggac 
ccgctacctc 
catcgtcggg 
gaaggctctg 
caagtcccag 
ccccaagttt 
gtctgcccca 
taaccagaaa 
tgggggctcg 
aggagggctt 
ctgattaaag 
ccattaatta 



ccagccagga 
ccactgctcg 
aagttcaagg 
cagaagctga 
cag.ctgctca 
acccaggaag 
ctgcgaccag 
atcgacctgt 
aagaagctag 
ggcttcgggt 
ctgcgaaacc 
gtgctgaagc 
tccggaaacc 
ccggaggaaa 
ttccatttcg 
cacctggagg 
ctggcgcaca 
gtgaagacct 
gaggactcca 
gtcttcctgg 
agcaatggag 
gtcccgatca 
gtcatccggg 
tgccggtgcc 
tgcggcatct 
ggccggagca 
tcagggctgg 
aagctgatat 
caggtctgcg 
ggctttgagg 
cgtgttgagt 
taccagctgc 
tcctgcgccg 
tgtccgggcc 
tcagagggct 
atctatgtgg 
ggcaccgtgg 
atccacctga 
tggaacaatg 
gctgagagtt 
tcacgcggcc 
tccagttatt 
tcggggggac 
gaggttggtg 
gtggtgccaa 
tattgttaat 



gcaccgccgc 
ccctggtggg 
tcagcagctg 
acttcacagg 
tgaggggctg 
accacaatgg 
gccaggcage 
actatctgat 
gtggcgacct 
ccttcgtgga 
catgccccaa 
tgaccaacaa 
tggatgcacc 
tcggctggcg 
cgggcgacgg 
acaacttgta 
agctggctga 
acgagaaact 
gcaatgtggt 
atcacaacgc 
tgacgcacag 
ccttccaggt 
cgctgggctt 
gggaccagag 
gcaggtgtga 
gccaggagct 
gggactgtgt 
acgggcagta 
gcggcccggg 
gctcagcgtg 
gtagtggtcg 
ctctgtgcca 
agtgcctgaa 
tgcagctgtc 
gctgggtggc 
atgagagccg 
caggcatcgt 
gcgacctccg 
ataatcccct 
aggagcactt 
gagacatggc 
ttccgccctc 
agctccactc 
aggttaggtg 
tttatttaca 
caatcacgtg 



ggactccagc 
gctgctctcc 
ccgggaatgc 
gccgggggat 
tgcggctgac 
gggccagaag 
agcgttcaac 
ggacctctcc 
gctccgggcc 
caagaccgtg 
caaggagaaa 
ctccaaccag 
cgagggtggg 
caacgtcacg 
aaagctgggc 
caagaggagc 
aaacaacatc 
caccgagatc 
ccatctcatt 
cctccccgac 
gaaccagccc 
gaaggtcacg 
cacggacata 
cagagaccgc 
cactggctac 
ggaaggaagc 
ctgcgggcag 
ctgcgagtgt 
gagggggctc 
ccagtgcgag 
tggccggtgc 
ggagtgcccc 
gttcgaaaag 
gaacaacccc 
ctacacgctg 
agagtgtgtg 
gctgatcggc 
ggagtacagg 
tttcaagagc 
ggtgaagaca 
ttggccacag 
aaaatgacag 
tgactggcac 
cgtgtttcct 
tttaaacttg 
tatagaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2776 



<210> 345 
<211> 3160 
<212> DNA 

<213> Homo sapiens 
<400> 345 
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WO 03/078662 



PCT/US03/07713 



cctcccctcg 
ttccgaggcg 
gatgtggcag 
gcgctcagtt 
tgtgaggcga 
gcccctctca 
gcggcggcgg 
cttcctcggc 
aggcgcggcg 
cggcggcggc 
ccagggctgg 
ggcggcggcg 
caccccccgt 
cgtcttctcc 
cccagtcgct 
gccaagcggc 
tgcagaagaa 
acaggctccc 
atcaagagga 
gatttcctgc 
ttttggattc 
atgacaccgc 
cacagctaga 
acaatcatgt 
gtgcatattt 
gggaagtaag 
attattatag 
acaagatgat 
tggtctgcca 
acaagttcat 
agttcttcca 
atacattctt 
gtgatcaaga 
tagtacttac 
acttttctcc 
atccagaggc 
attatagata 
agcatacaca 
aaataaactt 
gtcagattac 
catccacagg 
tatatacctt 
ctttcccgtt 
attttttcct 
gttcacatcc 
gtttccgaaa 
aatgctcaga 
ttacacacac 
gctgtcactg 
gctaaaggaa 
ccacaaatga 
accatataaa 
atgatttatt 



cccggcgcgg 
cccgggctcc 
gactctttat 
ctctcctctc 
ggccgggctc 
gcgcctgtga 
cagcggcggc 
ttctcctgaa 
gcggcggcgg 
cgcggcggct 
gaacgccgga 
gcacatccag 
ggcccgggct 
ccattccgct 
gcaaccatcc 
ggcagagcga 
gccccgccac 
agacatgaca 
tggattcgac 
agaaagactt 
aaagcataaa 
caaatttaat 
acttatcaaa 
tgcagcaatt 
attacatcgg 
gaccagagac 
ctacctgtta 
gtttgaaact 
gctaaaggtg 
gtactttgag 
caaacagaac 
cataccagga 
aatcgatagc 
tttaacaaaa 
aaattttaag 
tagcagttca 
ttctgacacc 
aattacaaaa 
gaataaactg 
cagttatagg 
gttttgacac 
tttgtgtcaa 
ttattccagt 
tttgtgttct 
tacccctttg 
ggttttgcta 
aaggaaataa 
ctttctttag 
cttgttgttt 
gtgaatctgt 
agggatataa 
tgtggaggct 
aaatatgttt 



39740-000lPCT.txt 
tcccgtccgc ctctcgctcg cctcccgcct 
cggcgcggcg gcggaggggg cgggcaggcc 
gcgctgcggc aggatacgcg ctcggcgctg 
ggaagctgca gccatgatgg aagtttgaga 
aggcgaggga gatgagagac ggcggcggcc 
gcagccgcgg gggcagcgcc ctcggggagc 
gtttctcgcc tcctcttcgt cttttctaac 
agggaaggtg gaagccgtgg gctcgggcgg 
cggcacctcc cgctcctgga gcggggggga 
gcagctccag ggagggggtc tgagtcgcct 
gagttggtct ctccccttct actgcctcca 
ggacccgggc cggttttaaa cctcccgtcc 
ccggaggccg ccggcggagg cagccgttcg 
gccgccgctg ccaggcctct ggctgctgag 
agcagccgcc gcagcagcca ttacccggct 
qgggcatcag ctaccgccaa gtccagagcc 
cagcagcrtc tgccatctct ctcctccttt 
gccatcatca aagagatcgt tagcagaaac 
ttagacttga cctatattta tccaaacatt 
gaaggcgtat acaggaacaa tattgatgat 
aaccattaca agatatacaa tctttgtgct 
tgcagagttg cacaatatcc ttttgaagac 
cccttttgtg aagatcttga ccaatggcta 
cactgtaaag ctggaaaggg acgaactggt 
ggcaaatttt taaaggcaca agaggcccta 
aaaaagggag taactattcc cagtcagagg 
aagaatcatc tggattatag accagtggca 
attccaatgt tcagtggcgg aacttgcaat 
aagatatatt cctccaattc aggacccaca 
ttccctcagc cgttacctgt gtgtggtgat 
aagatgctaa aaaaggacaa aatgtttcac 
ccagaggaaa cctcagaaaa agtagaaaat 
atttgcagta tagagcgtgc agataatgac 
aatgatcttg acaaagcaaa taaagacaaa 
gtgaagctgt acttcacaaa aacagtagag 
acttctgtaa caccagatgt tagtgacaat 
actgactctg atccagagaa tgaacctttt 
gtctgaattt ttttttatca agagggataa 
aaaatggacc tttttttttt taatggcaat 
aacaattctc ttttcctgac caatcttgtt 
ttgttgtcca gttgaaaaaa ggttgtgtag 
aaggacattt aaaattcaat taggattaat 
tttataaaaa gtggagacag actgatgtgt 
gtcaccaact gaagtggcta aagagctttg 
cacttgtggc aacagataag tttgcagttg 
ccattctaat gcatgtattc gggttagggc 
ttttatgctg gactctggac catataccat 
catgctacag ttattaatct ggacattcga 
gcgcattttt ttttaaagca tattggtgct 
attggggtac aggaatgaac cttctgcaac 
aaataatgtc ataggtaaga aacacagcaa 
atcaacaaag aatgggcttg aaacattata 
tctcaattgt aaaaaaaaaa 



cccctcggtc 
ggcgggcggt 
ggacgcgact 
gttgagccgc 
gcggcccgga 
cggccggcct 
cgtgcagcct 
gagccggctg 
gaagcggcgg 
gtcaccattt 
acacggcggc 
gccgccgccg 
gaggattatt 
gagaagcagg 
gcggtccaga 
atttccatcc 
ttcttcagcc 
aaaaggagat 
attgctatgg 
gtagtaaggt 
gaaagacatt 
cataacccac 
agtgaagatg 
gtaatgatat 
gatttctatg 
cgctatgtgt 
ctgttgtttc 
cctcagtttg 
cgacgggaag 
atcaaagtag 
ttttgggtaa 
ggaagtctat 
aaggaatatc 
gccaaccgat 
gagccgtcaa 
gaacctgatc 
gatgaagatc 
aacaccatga 
aggacattgt 
ttaccctata 
ctgtgtcatg 
aaagatggca 
atacgtagga 
tgatatactg 
gctaagagag 
aatggagggg 
ctccagctat 
ggaattggcc 
agaaaaggca 
atcttaagat 
caatgactta 
aaaattgaca 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3160 



<210> 346 

<211> 2629 

<212> DNA 

<213> Homo sapiens 



<400> 346 

acttgtcatg 

ggttttcccc 

agagccaccg 

gggagcgtgc 

gggtcactgc 

aaacattttc 

cccaagcaat 

acccaggtcc 



qcgactgtcc agctttgtgc caggagcctc gcaggggttg 
tcccatgtgc tcaagactgg cgctaaaagt tttgagcttc 
tccaqqgagc aggtagctgc tgggctccgg ggacactttg 
tttccacgac ggtgacacgc ttccctggat tggcagccag 
catggaggag ccgcagtcag atcctagcgt cgagccccct 
agacctatgg aaactacttc ctgaaaacaa cgttctgtcc 
qqatgatttg atgctgnccc cggacgatat tgaacaatgg 
aqatgaagct cccagaatgc cagaggctgc tccccgcgtg 
^ Page 89 



atgggattgg 60 
tcaaaagtct 120 
cgttcgggct 180 
actgccttcc 240 
ctgagtcagg 300 
cccttgccgt 360 
ttcactgaag 420 
gcccctgcac 480 



SUBSTITUTE SHEET (RULE 26) 



. <WO 03078662A1J_> 



WO 03/078662 



PCI7US0J/0771J 



39740-OOOlPCT.txt 

cagcagctcc tacaccggcg gcccctgcac cagccccctc ctggcccctg tcatcttctg 540 
tcccttccca gaaaacctac cagggcagct acggtttccg tctgggcttc ttgcattctg 600 
ggacagccaa gtctgtgact tgcacgtact cccetgccct caacaagatg ttttgccaac 660 
tggccaagac ctgccctgtg cagctgtggg ttgattccac acccccgecc ggcacccgcg 720 
tccgcgccat ggccatctac aagcagtcac agcacatgac ggaggttgtg aggcgctgcc 780 
cccaccatga gcgctgctca gatagcgatg gtctggcccc tcctcagcat cttatccgag 840 
tggaaggaaa tttgcgtgtg gagtatttgg atgacagaaa cacttttcga catagtgtgg 900 
tggtgcccta tgagccgcct gaggttggct ctgactgtac caccatccac tacaactaca 960 
tgtgtaacag ttcctgcatg ggcggcatga accggaggcc catcctcacc atcatcacac 1020 
tggaagactc cagtggtaat ctactgggac ggaacagctt tgaggtgcgt gtttgtgcct 1080 
gtcctgggag agaccggcgc acagaggaag agaatctccg caagaaaggg gagcctcacc 1140 
acgagctgcc cccagggagc actaagcgag\cactgcccaa caacaccagc tcctctcccc 1200 
agccaaagaa gaaaccactg gatggagaat atttcaccct tcagatccgt gggcgtgagc 1260 
gcttcgagat gttccgagag ctgaatgagg ecttggaact caaggatgcc caggctggga 1320 
aggagccagg ggggagcagg gctcactcca gecacctgaa gtccaaaaag ggtcagtcta 1380 
cctcccgcca taaaaaactc atgttcaaga cagaagggcc tgactcagac tgacattctc 1440 
cacttcttgt tccccactga cagcctccca cccccatctc tccctcccct gccattttgg 1500 
gttttgggtc tttgaaccct tgcttgcaat aggtgtgcgt cagaagcacc caggacttcc 1560 
atttgctttg teccggggct ccactgaaca agttggcctg cactggtgtt ttgttgtggg 1620 
gaggaggatg gggagtagga cataccagct tagattttaa ggtttttact gtgagggatg 1680 
tttgggagat gtaagaaatg ttcttgcagt taagggttag tttacaatca gccacattct 1740 
aggtaggtag gggcccactt caccgtacta accagggaag ctgtccctca tgttgaattt 1800 
tctctaactt caaggcccat atctgtgaaa tgctggcatt tgcacctacc tcacagagtg 1860 
cattgtgagg gttaatgaaa taatgtacat ctggccttga aaccaccttt tattacatgg 1920 
ggtctaaaac ttgaccccct tgagggtgcc tgttcectct ccctctccct gttggctggt 1980 
gggttggtag tttctacagt tgggcagctg gttaggtaga gggagttgtc aagtcttgct 2040 
ggcccagcca aaccctgtct gacaacctct tggtcgacct tagtacctaa aaggaaatct 2100 
caccccatcc cacaccctgg aggatttcat ctcttgtata tgatgatctg gatccaccaa 2160 
gacttgtttt atgctcaggg tcaatttctt ttttcttttt tttttttttt tttctttttc 2220 
tttgagactg ggtctcgctt tgttgcccag gctggagtgg agtggcgtga tcttggetta 2280 
ctgcagcctt tgcctccccg gctcgagcag tcctgcctca gcctccggag tagctgggac 2340 
cacaggttca tgccaccatg gccagccaac ttttgcatgt tttgtagaga tggggtctca 2400 
cagtgttgcc caggctggtc tcaaactcct gggctcaggc gatccacctg tctcagcctc 2460 
ccagagtgct gggattacaa ttgtgagcca ccacgtggag ctggaagggt caacatcttt 2520 
tacattctgc aagcacatct gcattttcac cccacccttc ccctccttct ccctttttat 2580 
atcccatttt tatatcgatc tcttatttta caataaaact ttgctgcca 2629 

<210> 347 
<211> 3442 

<212> DNA . 
<213> Homo sapiens 

<400> 347 

agccggtgcg ccgcagacta gggcgcctcg ggccagggag cgcggaggag ccatggccac 60 
cgctaacggg gccgtggaaa acgggcagcc ggacgggaag ccgccggccc tgccgcgccc 120 
catccgcaac ctggaggtca agttcaccaa gatatttatc aacaatgaat ggcacgaatc 180 
caagagtggg aaaaagtttg ctacatgtaa cccttcaact cgggagcaaa tatgtgaagt 240 
ggaagaagga gataagcccg acgtggacaa ggctgtggag gctgcacagg ttgccttcca 300 
gaggggctcg ccatggcgcc ggctggatgc cctgagtcgt gggcggctgc tgcaccagct 360 
ggctgacctg gtggagaggg accgcgccac cttggccgcc ctggagacga tggatacagg 420 
gaagccattt cttcatgctt ttttcatcga cctggagggc tgtattagaa ccctcagata 480 
ctttgcaggg tgggcagaca aaatccaggg caagaccatc cccacagatg acaacgtcgt 540 
atgcttcacc aggcatgagc ccattggtgt ctgtggggcc atcactccat ggaacttccc 600 
cctgctgatg ctggtgtgga agctggcacc cgccctctgc tgtgggaaca ccatggtcct 660 
gaagcctgcg gagcagacac ctctcaccgc cctttatctc ggctctctga tcaaagaggc 720 
cgggttccct ccaggagtgg tgaacattgt gccaggattc gggcccacag tgggagcagc 780 
aatttcttct caccctcaga tcaacaagat cgccttcacc ggctccacag aggttggaaa 840 
actggttaaa gaagctgcgt cccggagcaa tctgaagcgg gtgacgctgg agctgggggg 900 
gaagaacccc tgcatcgtgt gtgcggacgc tgacttggac ttggcagtgg agtgtgccca 960 
tcagggagtg ttcttcaacc aaggccagtg ttgcacggca gcctccaggg tgttcgtgga 1020 
ggagcaggtc tactctgagt ttgtcaggcg gagcgtggag tatgccaaga aacggcccgt 1080 
gggagacccc ttcgatgtca aaacagaaca ggggcctcag attgatcaaa agcagttcga 1140 
caaaatctta gagctgatcg agagtgggaa gaaggaaggg gccaagctgg aatgcggggg 1200 
ctcagccatg gaagacaagg ggctcttcat caaacccact gtcttctcag aagtcacaga 1260 
caacatgcgg attgccaaag aggagatttt cgggccagtg caaccaatac tgaagttcaa 1320 
aagtatcgaa gaagtgataa aaagagcgaa tagcaccgac tatggactca cagcagccgt 1380 
gttcacaaaa aatctcgaca aagccctgaa gttggcttct gccttagagt ctggaacggt 1440 
ctggatcaac tgctacaacg ccctctatgc acaggctcca tttggtggct ttaaaatgtc 1500 
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WO 03/078662 



PCT/US03/07713 



aggaaatggc 
caccatcaaa 
cggacggcgg 
cccgggacac 
tctcactctc 
gtcctgcctg 
agacagtgag 
gcttgaaatg 
ctgcaagcag 
cacgcctctt 
ttttggtttt 
ttggttagtg 
; ccgtgagatt 
agagcttagt 
aatattgttg 
gcaaatccta 
aaattacgca 
ttctccagtt 
taataacaca 
aaataacacc 
ctggcaggtt 
atgttttaac 
gttatccgac 
cttaagtttg 
agtgtgtgta 
attaccacat 
grtaaatcag 
gacactaacc 
ctttaatatg 
gtaagtacac 
tatatatagt 
ctataggcct 
aaataaaatt 



agagaactag 
cttggcgaca 
aatgtggcag 
attcttctgg 
ctgtttattc 
gggagggagc 
atactcaggg 
ctttgccgaa 
ggctcttgca 
tccgaggtga 
tgttttrttgt 
cataccgtgg 
cggcttcaaa 
caaagacgat 
aggttacctt 
ggataattca 
tttctatxac 
ctgttaccca 
gccttgttat 
tggattctat 
gtgaattttt 
ttgggaaact 
ggatactttt 
caaattaagt 
taacgaaaac 
ttggcttccc 
gatttggctt 
ttgatagtga 
aaatgctaca 
agaggtggta 
gttaggtttc 
gggaattccg 
ggatatttga 



39740-0001PCT.txt 
qtgaatacgc tttggccgaa tacacagaag 
agaacccctg aaggaaaggc ggggctcctt 
atgaaatgtg ctggaggaaa aaaatgacat 
aqqctttaca tctactggag ttgaatgatt 
accagactgg ggatgcctat aggttgtctg 
tgttggccat ttctgtgttt ccctttaaac 
cgttgttaac agggagtggt atttgaagtg 
tctgactcca gtaagaatgt gggaaaaccc 
ccagcggtct cctcagggtg gacctgctta 
aggtgggacc attccttggg aaaggattca 
tttcttgttt ttaaaaaaag gatttcacag 
aagggcgcca gggtctttgt ggattgcatg 
ccaatactgc ctttiggaata tgacagaatc 
atcacggtct accttaacca aggcactttc 
tgctgctaaa gatccaatct tctaacgeca 
cctcctcatt tgacaaatca gagctgtaat 
qttcactaac agcttatgat aagtctgtgt 
atttagatta gtaaagcgta cacaactgga 
ttttaagtcc tattttgata ttaatttctg 
qqaggacctc ggtcttcatc caagtggcct 
cttttcctct ttgggaatcc aaatgatgat 
qaaagtgttc ccatatagct tcaaaaacaa 
atggttacta actagtactt tcctaattgg 
tqgggagggc aataataaaa tgagggcccg 
catgtataaa atgggcctat cacccttgtc 
ttcatcagct aacacttatc acttatacta 
catacactga atttteagta ttttatctca 
tacgttagag ggttcctatt cttccattgt 
ttatttataa ttggtagagt tattgtatct 
tatttaaact tctgtaatat actgtattta 
acttctttta aggtttaccc ctgtggtgtg 
atcctagctg cagatcgcat cccacaatgc 
ga 



tgaaaactgt 
cctcaaacat 
ttctgacctt 
gctgttttcc 
tgaaatcgca 
cagatcctgg 
tccagcagtt 
cctgtgtgtt 
cagagcaagc 
cagtaaggtt 
tgagaaagtt 
ttgacattga 
aatagcccag 
ttaagcagaa 
caacagcata 
tcactttaac 
agtcttcctt 
aagactgctg 
attagttagt 
gagtatttca 
gtgcaatttc 
aaacaaatgt 
gaaagtagtg 
taacagaacc 
agagatataa 
ccaataactt 
agtagatata 
acgataatgt 
ttttatagtt 
gaaatggaaa 
gtttaaaaat 
gagaargata 



1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3442 



<210> 348 
<211> 737 
<212> DNA 
<213> Homo sapiens 



<400> 348 

ggagtttcgc 

ttcgaggccg 

aggaggtggt 

ggcagctccc 

gtcacctggg 

acatggtgaa 

actatgaggc 

agaccctgct 

tcgctgacta 

tggatgcgtt 

aggccttcct 

gagggttggg 

aaatttctaa 



cgccgcagtc 

ctgcgcggcc 

gaccgnggag 

caagttccag 

ccgcaccctt 

tgacggcgtg 

gggcaaggat 

gtcccagaac 

caacctgctg 

ccccctgctc 

ggcctcccct 

gggactctga 

gagagct 



ttcgccacca 
ctgcgcatgc 
acgtggcagg 
gacggagacc 
gggctctatg 
gaggacctcc 
gactatgtga 
cagggaggca 
gacttgctgc 
tcagcatatg 
gagtacgtga 
gcgggaggca 



tgccgcccta 
tgctggcaga 
agggctcact 
tcaccctgta 
ggaaggacca 
gctgcaaata 
aggcactgcc 
agaccttcat 
tgatccatga 
tggggcgcct 
acctccccat 
gagtttgcct 



caccgtggtc 
tcagggccag 
caaagcctcc 
ccagtccaat 
gcaggaggca 
catctccctc 
cgggcaactg 
tgtgggagac 
ggtcctagcc 
cagcgcccgg 
caatggcaac 
tcctttctcc 



tatttcccag 60 
agctggaagg 120 
tgcctatacg 180 
accatcctgc 240 
gccctggtgg 300 
atctacacca 360 
aagccttttg 420 
cagatctcct 480 
cctggctgcc ,540 
cccaagctca 600 
gggaaacagt 660 
aggaccaata 720 



<210> 349 

<211> 5189 

<212> DNA 

<213> Homo sapiens 



<400> 349 

atggccaagt 

acctgggtga 

tcaaagaagt 

cttcgtcctg 

gatgaagatg 

tttgatgaaa 

attaaagttt 

attccagtag 



cgggtggctg 
acaatgctgc 
tgtctgttga 
atacatatat 
taggaatgaa 
ttttggttaa 
ctattgatcc 
tagaacacaa 



cqqcgcggga gccggcgtgg gcggcggcaa 
aaaaaaagaa gagtcagaaa ctgccaacaa 
gagagtgtat cagaagaaga cacaacttga 
tgggtcagtg gagccattga cgcagttcat 
ttgcagggag gttacctttg tgccaggttt 
tqctgctgac aataaacaga gggataagaa 
tgaatctaac attataagca tttggaataa 
gqtagagaaa gtttatgttc ctgctttaat 
Page 91 



cggggcactg 60 
aaatgattct 120 
acacattctt 180 
gtgggtgtat 240 
atacaagatc 300 
catgacttgt 360 
tgggaaaggc 420 
ttttggacag 480 



SUBSTITUTE SHEET (RULE 26) 



. <wo_ 



_03076662A1_L> 



WO 03/078662 



PCT/US03/07713 



cttttaacat 
tatggtgcaa 
gaatacaaac 
aaaattaaac 
aaatttaaga 
gatttggctg 
aatggatttc 
gccctgaaag 
gaaaaaggat 
cacgtggatt 
aacaaagctg 
aattgcctta 
cccaaaagtt 
tgtggcattg 
aagaagtgtt 
aatgatgctg 
gccaaatcac 
ccactcaggg 
gctgaaataa 
gcagaatctc 
gatggttctc 
ttgaagcatg 
caggaacttt 
cagaaagcct 
aaggaatatt 
gatgatgctg 
ttaacaaatt 
ttatatggta 
ctcttctcaa 
ggccagcgga 
gcccagttgg 
atgatgacta 
cagcctattg 
tatattttca 
ctccttaagt 
attcccatgg 
cccaactatg 
cctcatccca 
aaccagtatg 
gagcttccag 
aatggaacag 
actgtgaaat 
ctgcataaag 
catatgggat 
ttacgattaa 
tctacaaagc 
atagagaata 
gacccagtga 
aaccagcatg 
attttaaata 
agagatgcaa 
aaagaggatt 
gatgttctgg 
aagaaactcc 
attacagcta 
gatactgcag 
gcaggagaag 
gagaagaagg 
gcaaagaaag 
ttggaagaaa 
gaaagaccta 
gatgatgaca 
ggggaagatg 
ggcaaatcaa 
aatctcttct 
agtaatgaag 
aaagttccaa 
aagcccaaga 



ccagtaacta 
aactttgtaa 
acagttttaa 
attttgatgg 
tggaaaaact 
gttcgtgtag 
gcagttatgt 
ttattcatga 
tccagcaaat 
atgtggtaga 
gtgtatcagt 
ttgaaaatcc 
ttgggtctaa 
tagaaagtat 
catcagtaaa 
gtggtaaaca 
tggctgtgtc 
gcaaaattct 
ataatattat 
tgaaaacctt 
acataaaagg 
gttttcttga 
ccttctacag 
ggaaaataaa 
ttgctgatat 
ccattacctt 
ttatggaaga 
ctgcaacaaa 
actcagacaa 
aagttttatt 
ctggctctgt 
ttgtgaattt 
gtcagtttgg 
caatgttaag 
tcctttatga 
ttttaataaa 
atgctaggga 
tgcttccaaa 
cagtcagtgg 
ttagaacttg 
ataaaacacc 
ttgtggtgaa 
tttttaaact 
gtctgaagaa 
gttattacgg 
ttaacaatca 
ggtcaaagaa 
aagcctggaa 
atgatagttc 
tgtctctgtg 
aagggcgaga 
tagcggcatt 
ctggaatgtc 
agttggaaga 
tgaaggcaga 
cagtaaaagt 
aggcattgac 
agcctggtac 
tgaagaaacg 
cagaacctgt 
aatacacatt 
ataatgattt 
aatttgttcc 
aagccactcc 
catttccttc 
aagattctgc 
gtaaaacggt 
gagccccaaa 



39740-000lPCT.txt 
tgatgatgat gagaaaaaag ttacaggtgg 
tattttcagt acaaagttta cagtagaaac 
gcagacatgg atgaataata tgatgaagac 
tgaagattac acatgcataa cattccaacc 
tgacaaggat attgtggcec tcatgactag 
aggggtcaag gtcatgttta atggaaagaa 
agatctttat gtgaaagaca aattggatga 
gcttgcaaat gaaagatggg atgtttgtct 
cagctttgta aatagtattg caactacaaa 
tcaagttgtt ggtaaactga ttgaagtagt 
gaaaccattt caagtaaaaa accatatatg 
aacttttgat tctcagacta aggaaaacat 
atgccagctg tcagaaaaat tttttaaagc 
cctgaactgg gtgaaattta aggctcagac 
atacagtaaa atcaaaggta ttcccaaact 
ttccctggag tgtacactga tattaacaga 
tggattaggt gtgattggac gagacagata 
taatgtacgg gaagcttctc ataaacagat 
taaaatagtt ggtctacaat ataagaaaag 
acgctatgga aagattatga ttatgaccga 
cctgcttatt aatttcatcc atcacaattg 
agagttcatt actcctattg taaaggcaag 
tattcctgaa tttgacgaat ggaaaaaaca 
gtactataaa ggattgggta ctagtacagc 
ggaaaggcat cgcatcttgt ttagatatgc 
ggcatttagt aagaagaaga ttgatgacag 
ccggagacag cgtaggctac atggcttacc 
gcatttgact tataatgatt tcatcaacaa 
tgaaagatct ataccatctc ttgttgatgg 
tacctgtttc aagaggaatg ataaacgtga 
tgctgagatg tcggcttatc atcatggaga 
ggctcagaac tttgtgggaa gtaacaacat 
aactcggctt catggtggca aagatgctgc 
cactttagca aggctacttt ttcctgctgt 
tgataatcaa cgtgtagagc ctgagtggta 
tggtgctgag ggcattggta ctggatgggc 
aattgtgaac aatgtcagac gaatgctaga 
ctacaaaaac tttaaaggca cgattcaaga 
tgaaatattt gtagtggaca gaaacacagt 
gacacaggta tataaagaac aggttttaga 
agcattaatt tctgattata aagaatatca 
aatgactgaa gagaaactag cacaagcaga 
tcaaactact cttacttgta attccatggt 
atatgaaact gtgcaagaca ttctgaaaga 
tttacgtaag gagtggcttg tgggaatgtt 
agcccgtttc attttagaga agatacaagg 
agatttgatt caaatgttag tccagagagg 
agaagcacaa gaaaaggcag cagaagagga 
ctccgattca ggaactcctt caggcccaga 
gtctcttact aaagaaaaag ttgaagaact 
ggtcaatgat cttaaaagaa aatctccttc 
tgttgaagaa ctggataaag tggaatctca 
tggaaaagca attaaaggta aagttggcaa 
gacaatgccc tcaccttatg gcagaagaat^ 
tgccagcaaa aagttgctga agaagaagaa 
ggaatttgat gaagaattca gtggagcacc 
tccatcagtt cctataaata aaggtcccaa 
cagagtgaga aaaacaccta catcatctgg 
gaatccttgg tcagatgatg aatccaagtc 
ggttattcca agagattctt tgcttaggag 
tgatttctca gaagaagagg atgatgatgc 
agaggaattg aaagttaaag catctcccat 
ttcagatggg ttagataaag . atgaatatac 
agaaaaatct ttgcatgaca aaaaaagtca 
atattctcag aagtcagaag atgattcagc 
ttctgttttt tcaccatcat ttggtctgaa 
agctgctaaa aagggaaaac cgtcttcaga 
acagaagaaa gtagtagagg ctgtaaactc 
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tcgtaatggt 540 
agcttgcaaa 600 
ttctgaagcc 660 
agatctgtcc 720 
aagggcatat 780 
attgcctgta 840 
aactggggtg 900 
cacattgagt 960 
aggtggacgg 1020 
taagaaaaag 1080 
ggtttttatt 1140 
gactctgcag 1200 
agcctctaat 1260 
tcagctgaat 1320 
ggatgatgct 1380 
gggagactct 1440 
cggagttttt 1500 
catggaaaat 1560 
ttacgatgat 1620 
tcaggatcaa 1680 
gccatcactt 1740 
caaaaataag 1800 
tatagaaaac 1860 
taaagaagca 1920 
tggtcctgaa 1980 
aaaagaatgg 2040 
agagcaattt 2100 
ggaattgatt 2160 
ctttaaacct 2220 
agtaaaagtt 2280 
acaagcattg 2340 
taacttgctt 2400 
aagccctcgt 2460 
ggatgacaac 2520 
tattcctata 2580 
ttgtaaacta 2640 
tggcctggat 2700 
acttggtcaa 2760 
agaaattaca 2820 
acctatgcta 2880 
tactgacaca 2940 
agctgctgga 3000 
actttttgat 3060 
attctttgat 3120 
gggagcagaa 3180 
gaaaattact 3240 
ttatgaatct 3300 
tgaaacacaa 3360 
ttttaattat 3420 
gattaaacag 3480 
agatctttgg 3540 
agaacgagaa 3600 
acctaaggtg 3660 
aattcctgaa 3720 
gggtgatctt 3780 
agtagaaggt 3840 
acctaagagg 3900 
taaacctagt 3960 
agaaagtgat 4020 
agcagcagcc 4080 
tgatgatgat 4140 
aacaaatgat 4200 
attttcacca 4260 
ggattttgga 4320 
taaatttgac 4380 
acagacagat 4440 
tacagtccct 4500 
tgactcggat 4560 



SUBSTITUTE SHEET (RULE 26) 



BNSDOCID: <WO 03078662A1_I_> 



WO 03/078662 



PCT/US03/07713 



tcagaatttg 
aaaaggaaag 
aaaacaacaa 
ttcccctcag 
gaagtaaaat 
aattaagtgc 
ttctctgtct 
tttttartat 
tttttactca 
cgtgcattag 
gaaatttaaa 



gcattccaaa 
catctggctc 
gcaagaaacc 
acttccctac 
attttgcaga 
ccaaagagca 
cagacttttg 
tgtggtaggc 
ttgaaatgtc 
cacagatttt 
cattaaaaat 



39740-O0OlPCT.txt 
qaagactaca acaccaaaag gtaaaggccg 
tgaaaatgaa ggcgattata accctggcag 
qaagaagaca tcttttgatc aggattcaga 
tgagccacct tctctgccac gaaccggtcg 
gtctgatgaa gaagaagatg atgttgattt 
caaacatttt tcaacaaata tcttgtgttg 
tacatctggc ttattttaat gtgatgatgt 
cttttaacat tttgtuctta cacatacagt 
acgtactgtc tgattggctt gtagaattgt 
aattgtcatg gttacaaact acagacctgc 
ggaactgtg 



aggggcaaag 
gaaaacatcc 
tgtggacatc 
ggctaggaaa 
tgcaatgttt 
tccttttgtc 
aattgacggt 
tttatgctct 
tatagactgc 
tttttgaaat 



4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5189 



<210> 350 
<211> 1536 
<212> DNA 

<213> Homo sapiens 



<400> 350 

ggggggg9gg 

cccgccgcgc 
tcggagctgc 
ccgccgcacg 
aggaaggacg 
agcctgagag 
gaggagttgc 
gtgaaaatct 
agagaagaag 
tacagagata 
gacaccatca 
gatcttcctc 
agtgagctgt 
aacatcgcca 
ggtgacttta 
aaaattcagc 
gagaaaattg 
actattaaaa 
gtctttaggg 
aactaggtca 
ggcaaatgta 
gtatctgaca 
caaaaacatg 
aagctatttt 
ctgagggagc 
tgttttatat 



ggaccacttg 
cacttcgcct 
cgcgccggcc 
gggagctgca 
accgcacggg 
atgaattccc 
tgtggtttat 
gggatgccaa 
gggacttggg 
tggaatcaga 
aaaccaaccc 
tgatggcgct 
cctgccagct 
gctacgccct 
tacacacttt 
ttcagcgaga 
atgacttcaa 
tggaaatggc 
gttgggctgg 
aaaatctgtc 
actgtgccag 
atgctgaggt 
tatgtgcatt 
tggaatattt 
tgagtaacac 
gttgctataa 



gcctgcctcc 
gcctccgtcc 
cttgcccccc 
gtacctgggg 
caccggcacc 
tctgctgaca 
caagggatcc 
tggatcccga 
cccagtttat 
ttattcagga 
tgacgacaga 
gcctccatgc 
gtaccagaga 
gctcacgtac 
gggagatgca 
acccagacct 
agctgaagac 
tgtttagggt 
atgccgaggt 
cgtgacctat 
ttctttccat 
tatgaacaaa 
tcaatcccac 
ttagaatatt 
catcgatcat 
taaagaagtg 



gtcccgccgc 
cccgcccgcc 
gccgcacagg 
cagatccaac 
ctgtcggtat 
accaaacgtg 
acaaatgcta 
gactttttgg 
ggcttccagt 
cagggagttg 
agaatcatca 
catgccctct 
tcgggagaca 
atgattgcgc 
catatttacc 
ttcccaaagc 
tttcagattg 
gctttcaaag 
aaaagttctt 
cagttattaa 
aataaaaggc 
gtgaggagaa 
gtacttataa 
ttaagaattt 
gatgtagagt 
ttctgc 



gccacttggc 
gcgccatgcc 
agcgggacgc 
acatcctccg 
tcggcatgea 
tgttctggaa 
aagagctgtc 
acagcctggg 
ggaggcattt 
accaactgca 
tgtgcgcttg 
gccagttcta 
tgggcctcgg 
acatcacggg 
tgaatcacat 
tcaggattct 
aagggtacaa 
gagcttgaag 
tttgctctaa 
tttttaagga 
tttgagttaa 
tgaaatgtat 
agaaggttgg 
cacaagctat 
gtggttatga 



ctgcctccgt 
tgtggccggc 
cgagccgcgt 
ctgcggcgtc 
ggcgcgctac 
gggtgttttg 
ttccaaggga 
attctccacc 
tggggcagaa 
aagagtgatt 
gaatccaaga 
tgtggtgaao 
tgtgcctttc 
cctgaagcca 
cgagccactg 
tcgaaaagtt 
tccgcatcca 
gatattgtca 
aagaaaaagg 
tgttgccact 
ctcactgagg 
gtgctcttag 
tgaatttcac 
tccctcaaat 
actttatagt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1536 



<210> 351 
<211> 2386 
<212> DNA 

<213> Homo sapiens 



<400> 351 

ggaggaggaa 

gggctcccag 

accgcgaggc 

acaaaagatc 

ttctctccct 

acgagactca 

ataaaaatgt 

cagaaaggat 

gaagggactg 

ctggcagaca 

gaggtgatcc 

ctgcggtacc 

gcatatcaga 

gagagccgaa 

ggcgtggttc 



gcaagcgagg 
cctgccgggt 
cggctgagtt 
ttccgggggc 
tcagaatctt 
gtgagtgagc 
ctcagagcaa 
acagctggag 
aatcggagat 
gccccgcggt 
ccatggcagc 
ggcgggcatt 
gctttgaaca 

agggcc a gg a 

tgctgggctc 



gggctggttc ctgagcttcg caattcctgt 
cgcatgatcc ctccggccgg agctggtttt 
accggcatcc ccgcagccac ctcctctccc 
tgcacctgcc tgcctttgcc taaggcggat 
atcttggctt tggatcttag aagagaatca 
aggtgttttg gacaanggac tggttgagcc 
ccgggagctg gtggttgact ttctctccta 
tcagtttagt gatgtggaag agaacaggac 
qgagaccccc agtgccatca atggcaaccc 
gaatggagcc actggccaca gcagcagutt 
aqtaaagcaa gcgctgaggg aggcaggcga 
cagtgacctg acatcccagc tccacatcac 
qgatactttt gtggaactct atgggaacaa 
acgcttcaac cgctggttcc tgacgggcat 
actcttcagt cggaaatgac cagacactga 
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gtcgccttct 60 
tttgccagcc 120 
gacctgtgat 180 
ttgaatctct 240 
ctaaccagag 300 
catccctatt 360 
caagctttcc 420 
tgaggcccca 480 
atcctggcac 540 
ggatgcccgg 600 
cgagtttgaa 660 
cccagggaca 720 
tgcagcagcc 780 
gactgtggcc 840 
ccatccactc 900 



SUBSTITUTE SHEET (RULE 26) 



. <WO 03076662A1J_> 



WO 03/078662 



PCT/US03/07713 



tacectccca 

gagaaccact 

tcccttgcag 

cccttggcct 

tggaagggtt 

tagcctccct 

agaaaggaga 

ccccactctc 

atttaggggc 

gactccetcc 

aagcctctgg 

ccccatgcca 

ctggctccca 

ttatggcctc 

gctcctcatg 

ctatgggagc 

gcatggggtg 

ccccatgcct 

ctcttcaggc 

cagccacccc 

gatatactat 

gcgcgtgcac 

cctggtccag 

acccccaccc 

ttcagtaata 



cccccttctc 

acatgcagcc 

ctagttttct 

cagaattcac 

tgtggctggg 

gaaaatgttt 

ctagattgcc 

cccacactaa 

cacttttgac 

tcacctctgg 

ctagaggcca 

aagctagggt 

tgaccatact 

agctgcctca 

gtgggttcag 

cccagggtct 

gggtagggct 

gtccagcctg 

ctccctctct 

gggctctctg 

ttttgttaag 

gtggagagct 

ggaagctggc 

cacacttgtt 

aactgtgttt 



39740-000lPCT.txt 
tgctccacca catcctccgt ccagccgcca 
catgcccacc tgcccatcac agggttgggc 
agaatttatc acacttctgt gagaccccca 
aaaatttcca caaaatctgt ccaaaggagg 
ggcaggaggg ccctacctga ttggtgcaac 
ttctgccagg gagcttgaaa gttttcagaa 
tttgttttga tgtttgtggc ctcagaattg 
cctgggttcc ctttccttcc atccctaccc 
tagggattca ggctgcttgg gataaagatg 
actggctaga gtcctcactc ccagtccaaa 
gccccaccca ggagggaggg ggctatagct 
ggcccttgca gttcagcacc accctagtcc 
gagggaccaa ctgggcccaa gacagatgcc 
cttcctacaa gagcagcctg tggcatcttt 
gggactcagc cctgaggtga aagggagcta 
tccctacctc aggcaggaag ggcaggaagg 
gactagaagg gccagtcctg cctggccagg 
ggcagccagg ctgccaaggc cagagtggcc 
cttctgctcc acccttggcc tgtctcatcc 
ctgtacatat ttgagactag tttttattcc 
cgtgtctgta tttatgtgtg aggagctgct 
ggtgcccgga gattggacgg cctgatgctc 
cgagggtcct ggctcctgag gggcatctgc 
ccagctctrt gaaatagtct gtgtgaaggt 
actcagtgaa aaaaaaaaaa aaaaaa 



ttgccaccag 

ccagatctgg 

cacctcagtt 

ctggcaggta 

ccttacccct 

cctcttcccc 

atcattttcc 

cctaagagcc 

caaggaccag 

tgtccrccag 

acaggaagca 

cttcccctcc 

ccagagctgt 

gccttgggct 

tcaggaacag 

agagcctgct 

cagatctgtg 

tggccaggag 

ccaggggtcc 

ttgtgaagat 

ggcttgcagt 

cctcccctgc 

ccctccccca 

gaaagtgcag 



960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2386 



<210> 352 
<211> 1270 
<212> DNA 

<213> Homo sapiens 



<400> 352 

agacgttcgc 

gcccgccgcc 

ctcttccaac 

agaccgagac 

gtggtgctgt 

gacacctact 

gttacctcaa 

tctagtagtg 

ccacaaatac 

ccccaaaaca 

ggcaaaaatg 

caaaatttct 

aaatgtcagc 

atgattgaca 

gtgatcctca 

tatgaaggag 

atagcataac 

cataaatgat 

tgatggtttt 

aaggtaaaaa 

ttattttaca 

aaaaaaaaaa 



acacctgggt 
ccgagctccc 
acccgctcgt 
gccgccgctg 
cgctcttgat 
ctgggaagcg 
gaagtgagat 
aaccgtcctc 
ctggctatat 
agacggaaag 
gaaaaaatag 
gcattcacgg 
aagaatattt 
gtagtttatc 
cagctgttgc 
aagctgagga 
tgaagataaa 
gagtcggtcc 
aaactttcaa 
gtattttttc 
gctcattaaa 



gccagcgccc 
caagccttcg 
tttggcggca 
cgaaggacca 
actcggctca 
tgaaccattt 
gtcttcaggg 
gggagccgac 
tgtcgatgat 
tgaaaatact 
aagaaacaga 
agaatgcaaa 
cggtgaacgg 
aaaaattgca 
tgttattaca 
acgaaagaaa 
attacaggat 
tctttccagt 
ttgtcacttt 
aagttgtaaa 
cttttttaac 



cagaggtccc 
agagcggcgc 
gctcgtgtcc 
atgagagccc 
ggccattatg 
tctggggacc 
agtgagattt 
tatgactact 
tcagtcagag 
tcagataaac 
aagaagaaaa 
tatatagagc 
tgtggggaaa 
ttagcagcca 
gtccagctta 
cttcgacaag 
atcacattgg 
ggatcataag 
ttatgctatt 
taatttattt 
caaacagaaa 



gggacagccc 

acactcccgg 

cagagaccga 

cgctgctacc 

ctgctggatt 

acagtgctga 

cccctgtgag 

cagaagagta 

ttgaacaggt 

ccaaaagaaa 

atccatgtaa 

acctggaagc 

agtccatgaa 

tagctgcctt 

gaagacaata 

agaatggaaa 

agtcactgcc 

acaatggacc 

tctgtatata 

aatatttaat 

aaaaaaaaaa 



gaggcgccgc 

tctccactcg 

gttgccccag 

gccggcgccg 

ggacctcaat 

tggatttgag 

tgaaatgcct 

tgataacgaa 

agttaagccc 

gaaaaaggga 

tgcagaattt 

agtaacatgc 

aactcacagc 

tatgtctgct 

cgtcaggaaa 

tgtacatgct 

aagtcatagc 

ctttttgtta 

aaggtgcacg 

ggaagtgtat 

aaaaaaaaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1270 



<210> 353 

<211> 1600 

<212> DNA 

<213> Homo sapiens 



<400> 353 

gccccgccgc 

gggcgaggcc 

caggcccgga 

tgacttctcc 

cccggagctg 

cttcgtggcc 

ggccatccga 



cggcagtgga 
gggtacctgg 
gcgatggcag 
ggggaaggga 
atccgcatga 
gctgtggtga 
cttcggggca 



ccgctgtgcg cgaaccctga accctacggt 
gctgggatcc ggagcaagcg ggcgagggca 
ccttgatgac cccgggaacc ggggccccac 
gccagggact tcccgaccct tcgccagagc 
agcgagacgg aggccgcctg agcgaagcgg 
atgggagcgc gcagggcgca cagatcgggg 
tggatctgga ggagacctcg gtgctgaccc 
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cccgacccgc 60 
gcgccctaag 120 
ccgcgcctgg 180 
ccaagcagct 240 
acatcagggg 300 
ccatgctgat 360 
aggccctggc 420 



SUBSTITUTE SHEET (RULE 26) 



BNSDOCID: <WO 03078662A1_I_> 



WO 03/078662 



PCT/US03/07713 



tcagtcggga 

ttccacaggg 

tggctgcaag 

taagctggag 

gctggaccag 

aatcctatat 

ctccattctc 

cggaggggcc 

tggcgtggga 

cctgggtcgc 

cgcaggcccg 

cagcggacac 

cggctcggcc 

ggcccgagcc 

ggagcaggag 

gctggcgctg 

cctgggggtg 

ctggctccgc 

ggaggcgctc 

cgttctgccg 



cagcagctgg 
ggtgtgggtg 
gtgccaatga 
tctattcctg 
gcgggctgct 
gcagccagag 
agtaagaaac 
gccgtcttcc 
gccagcctag 
tgcgtgggcc 
ccagacttaa 
gcggggactc 
cttggccgct 
ctgtgctcgg 
gagctgctgg 
gtgctgcacg 
ggcgcagagc 
gtgcaccggg 
gtactctccg 
ccgcagcaat 



39740-0001PCT . txt 
agtggccaga ggectggcgc cagcagcttg 
acaaggtcag cctggtcctc gcacctgccc 
tcagcggacg tggtctgggg cacacaggag 
gattcaatgt catccagagc ccagagcaga 
gtatcgtggg tcagagtgag cagctggttc 
atgtgacagc caccgtggac agcctgccac 
tcgtggaggg gctgtccgct ctggtggtgg 
ccaaccagga gcaggcccgg gagctggcaa 
ggcttcgggt cgcggcagcg ctgaccgcca 
acgccctgga ggtggaggag gcgctgctct 
qqgacctggt caccacgctc gggggcgccc 
aggctcaggg cgctgcccgg gtggccgcgg 
tcgagcggat gctggcggcg cagggcgtgg 
gaagtcccgc agaacgccgg cagctgctgc 
cgcccgcaga tggcaccgtg gagctggtcc 
agctcggggc cgggcgcagc cgcgctgggg 
tgctggtcga cgtgggtcag aggctgcgcc 
acggccccgc gctcagcggc ccgcagagcc 
accgcgcgcc attcgccgcc ccctcgccct 
aaagctcctt tgccgcgaaa 



tggacaagca 480 
tggcggcatg 540 
gcaccttgga 600 
tgcaagtgct 660 
ctgcggacgg 720 
tcatcacagc 780 
acgttaagtt 840 
agacgctggt 900 
tggacaagcc 960 
gcatggacgg 1020 
tgctctggct 1080 
cgctggacga 1140 
atcccggtct 1200 
ctcgcgcccg 1260 
gggcgctgcc 1320 
agccgctccg 1380 
gtgggacccc 1440 
gcgccctgca 1500 
tcgcagagct 1560 



<210> 354 
<211> 1842 
<212> DNA 

<213> Homo sapiens 



<400> 354 
cgatcagatc 
ccgactacag 
attaaacatc 
acttggcaag 
ctgtacaacc 
aaggatggta 
attacattga 
ctgtgcctta 
aatgttagag 
gctgttacac 
attggttatc 
tttgttgttt 
gatttgggag 
ttccatctta 
gtgttcattt 
agtacacatc 
attaccatta 
ttctatactt 
tggtttgaaa 
cattgtcttt 
ttgataatat 
ttttttcctc 
attaattttg 
caaatttatg 
tgtggattat 
ttttgtatct 
aaaaatggct 
ataaaatgtg 
ctttatagaa 
gtggtgtctt 
acaattttta 



gatctaagat 
aagaggagaa 
ccctacagaa 
caaacctgcg 
atatccagtt 
ttgagcctat 
acaaacagca 
ttggagaatc 
ctaaaggtga 
atatagggag 
agtcccacgc 
aagaagacac 
ctgaaccaaa 
atgttactca 
tttttttttt 
cccagaatcc 
attacataca 
tacaggaaaa 
gattatgatg 
gctgtactct 
tagtgattct 
atgtaacatt 
tgcattcttt 
catttcattc 
aaaaaatgta 
ttatgctgta 
caagtagaaa 
tacagtgaat 
ggaatttgta 
ccatgtgaac 
ataaattgaa 



ggcgactgtc 
aacggaatct 
cagatgggca 
gctgatctcc 
gtctagtaat 
gtgggaagat 
gagacgaagt 
ttttgatgac 
taaganagca 
ggtatacaag 
agacacagct 
cttctgagta 
gcctcttcaa 
gatataagag 
tttttggcta 
ataaatgtgt 
gattttacct 
aaaattctgt 
catacagttt 
tgctgatggc 
gatttcagtt 
ggtgaaggat 
ggtaattttt 
aaatcagtga 
acatcataat 
ttttaacact 
agcagtccca 
tgtcttttag 
cttcttattg 
agcataagtt 
taggtagtat 



gaaccggaaa 
aatcaggagg 
ctctggtttt 
aagtttgata 
ttaatgcctg 
gagaaaaaca 
gacctcgatc 
tacagtgatg 
atatggacta 
gaaaggttag 
actaagagcg 
ttctcatagg 
aaagcagagt 
aagtctcatt 
gagtttccac 
tcctggccca 
atccacaata 
tgttccattt 
tctagcaatt 
tgctagattt 
tttcatttgt 
ccaggaatat 
tttgtttttt 
tctatgtttg 
tacattccta 
ttgtattact 
ttcatattaa 
acaactagat 
caggcaagtc 
tggagcacta 
catatatatg 



ccacccctac 
ttgctaaccc 
ttaaaaatga 
ctgttgaaga 
gctgtgacta 
aacggggagg 
gcttttggct 
atgtatgtgg 
ctgaatgtga 
gacttcctcc 
gctccaccac 
agactgcgtc 
ggactgcatt 
cgcctttgtc 
tatcccaatc 
ctctgtaata 
gtcagaaaac 
tatgcagaag 
ttctttgttt 
taatttattt 
tttgcttaaa 
gacacaaagg 
gtaactacaa 
tgtgatttcc 
actagaatta 
taggttattt 
gacagtgtac 
ttgtcctttt 
tctatattat 
gtttgattat 
ga 



tcctaatccc 
agaacactat 
taaaagcaaa 
cttttgggct 
ctxacttttt 
acgatggcta 
agagacactt 
cgctgttgtt 
aaacagagaa 
aaagatagtg 
taaaaatagg 
aagcaatcga 
taaatttgat 
ttgtacttct 
aaagaattac 
gtucagtaga 
aacttggcat 
catattttgc 
ctttttacag 
gtttccctac 
tttttttttt 
tggaataaac 
agctttgcta 
taaacataat 
gtatgtctgt 
tgctttggtt 
aaaactgtaa 
atttctccat 
gtcctctttt 
tatgtttatt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1842 



<210> 355 
<211> 4975 
<212> DNA 

<213> Homo sapiens 

111 SI- SES Kiwi ft 



SUBSTITUTE SHEET (RULE 26) 



. <WO 03078662A1_I_> 



WO 03/078662 



PCT/USOJ/07713 



ttcaccctct 
agcctggccc 
ggcctgagtg 
aggtgtgagg 
tccttcctgc 
tctactctac 
tttgccatct 
cgcttgactc 
ctttgtcaca 
gtggtgaagg 
tggggtcctg 
aatggtcact 
tgctcaggcc 
tgtgtacctc 
aatccccaca 
gtggtggatc 
aatgggctca 
ggctctggga 
accaagatcc 
aagatccctg 
ggttacctga 
ttgacaacca 
aacttgaatg 
tatataagtg 
cgggggccta 
gcagagggca 
ggtcagtgct 
tttctgaatg 
gaatgccaac 
caatgtgccc 
ggtgccaagg 
gagaactgca 
gtgctgatcg 
attttcatga 
agggctatga 
aaggctaaca 
cttggctcgg 
atcaagattc 
gctgtgacag 
ctgggactat 
ctgctggatc 
ggagtacaaa 
ctggctgccc 
gtggctgacc 
attaagtgga 
tggagctatg 
ctacgattgg 
atctgcacaa 
cgcccaacct 
tatctggtca 
ctgacaaaca 
ttggaagcag 
gttggaacac 
atgcccatga 
agtgaacggt 
gagtcatcag 
tgtagaagcc 
cagcgccaca 
gatgtcaacg 
ggcacccttt 
gaggagtatg 
agttcccttg 
ctgggcagca 
actccagatg 
gattatgcag 
tttcaggggc 
agcttagagg 
cccaaggcta 



gcggagtcat 

ggggctccga 

tgaccggcga 
tggtgatggg 
agtggattcg 
cattgcccaa 
tcgtcatgtt 
agctcaccga 
tggacacaat 
acaatggcag 
gatcagaaga 
gctttgggcc 
ctcaggacac 
gctgtccaca 
ccaagtatca 
aaacatcctg 
agatgtgtga 
gccgcttcca 
tgggcaacct 
ccctggaccc 
acatccagtc 
ttggaggcag 
tcacatctct 
ccaataggca 
cggaagagcg 
aagtgtgtga 
tgtcctgtcg 
gggagcctcg 
ccatgggggg 
attttcgaga 
gcccaatcta 
cccaggggtg 
gcaaaaccca 
tgctgggcgg 
ggcgatactt 
aagtcttggc 
gtgtctttgg 
cagtctgcat 
atcatatgct 
gcccagggtc 
atgtgagaca 
ttgccaaggg 
gaaacgtgct 
tgctgcctcc 
tggcccttga 
gtgtgacagt 
ctgaagtacc 
ttgatgtcta 
ttaaagaact 
taaagagaga 
agaagctaga 
aggaggacaa 
ttaatcggcc 
accagggtaa 
gcccccgtcc 
aggggcatgt 
ggagcaggag 
gtctgctgac 
gttatgtcat 
cttcagtggg 
aatacatgaa 
aggagctggg 
cacagagttg 
aagactatga 
ccatgggggc 
ctggacatca 
ctacagactc 
atgcccagag 



39740-O0OlPCT.txt 
gagggcgaac gacgctctgc aggtgctggg 
ggtgggcaac tctcaggcag tgtgtcctgg 
tgctgagaac caataccaga cactgtacaa 
gaaccttgag attgtgctca cgggacacaa 
agaagtgaca ggctatgtcc tcgtggccat 
cctccgcgtg gtgcgaggga cccaggtcta 
gaactataac accaactcca gccacgctct 
gattctgtca gggggtgttt atattgagaa 
tgactggagg gacatcgtga gggaccgaga 
aagctgtccc ccctgtcatg aggtttgcaa 
ctgccagaca ttgaccaaga ccatctgtgc 
caaccccaac cagtgctgcc atgatgagtg 
agactgcttt gcctgccggc acttcaatga 
gcctcttgtc tacaacaagc taactttcca 
gtatggagga gtttgtgtag ccagctgtcc 
tgtcagggcc tgtcctcctg acaagatgga 
gccttgtggg ggactatgtc ccaaagcctg 
gactgtggac tcgagcaaca ttgatggatt 
ggactttctg atcaccggcc tcaatggaga 
agagaagctc aatgtcttcc ggacagtacg 
ctggccgccc cacatgcaca acttcagtgt 
aagcctctac aaccggggct tctcattgtt 
gggcttccga tccctgaagg aaattagtgc 
gctctgctac caccactctt tgaactggac 
actagacatc aagcataatc ggccgcgcag 
cccactgtgc tcctctgggg gatgctgggg 
aaattatagc cgaggaggtg tctgtgtgac 
agaatttgcc catgaggccg aatgcttctc 
cactgccaca tgcaatggct cgggctctga 
tgggccccac tgtgtgagca gctgccccca 
caagtaccca gatgttcaga atgaatgtcg 
taaaggacca gagcttcaag actgtttagg 
tctgacaatg gctttgacag tgatagcagg 
cacttttctc tactggcgtg ggcgccggat 
ggaacggggt gagagcatag agcctctgga 
cagaatcttc aaagagacag agctaaggaa 
aactgtgcac aaaggagtgt ggatccctga 
taaagtcatt gaggacaaga gtggacggca 
ggccattggc agcctggacc atgcccacat 
atctctgcag cttgtcactc aatatttgcc 
acaccggggg gcactggggc cacagctgct 
aatgtactac cttgaggaac atggtatggt 
actcaagtca cccagtcagg ttcaggtggc 
tgatgataag cagctgctat acagtgaggc 
gagtatccac tttgggaaat acacacacca 
ttgggagttg atgaccttcg gggcagagcc 
agacctgcta gagaaggggg agcggttggc 
catggtgatg gtcaagtgtt ggatgattga 
agccaatgag ttcaccagga tggcccgaga 
gagtgggcct ggaatagccc ctgggccaga 
ggaagtagag ctggagccag aactagacct 
cctggcaacc accacactgg gctccgccct 
acgtgggagc cagagccttt taagtccatc 
tcttgggggg tcttgccagg agtctgcagt 
agtctctcta cacccaatgc cacggggatg 
aacaggctct gaggctgagc tccaggagaa 
ccggagccca cggccacgcg gagatagcgc 
tcctgttacc ccactctccc cacccgggtt 
gccagataca cacctcaaag gtactccctc 
tctcagttct gtcctgggta ctgaagaaga 
ccggaggaga aggcacagtc cacctcatcc 
ttatgagtac atggatgtgg ggtcagacct 
cccactccac cctgtaccca tcatgcccac 
atatatgaat cggcaacgag atggaggtgg 
ctgcccagca tctgagcaag ggtatgaaga 
ggccccccat gtccattatg cccgcctaaa 
tgcctttgat aaccctgatt actggcatag 
aacgtaactc ctgctccctg tggcactcag 
page 96 



cttgcttttc 
gactctgaat 
gctctacgag 
tgccgacctc 
gaatgaattc 
cgatgggaag 
gcgccagctc 
gaacgataag 
tgctgagata 
ggggcgatgc 
tcctcagtgt 
tgccgggggc 
cagtggagcc 
gctggaaccc 
ccataacttt 
agtagataaa 
tgagggaaca 
tgtgaactgc 
cccctggcac 
ggagatcaca 
tttttccaat 
gatcatgaag 
tgggcgtatc 
caaggtgctt 
agactgcgtg 
cccaggccct 
ccactgcaac 
ctgccacccg 
tacttgtgct 
tggagtccta 
gccctgccat 
acaaacactg 
attggtagtg 
tcagaataaa 
ccccagtgag 
gcttaaagtg 
gggtgaatca 
gagttttcaa 
tgtaaggctg 
tctgggttct 
gctcaactgg 
gcatagaaac 
agattttggt 
caagactcca 
gagtgatgtc 
ctatgcaggg 
acagccccag 
tgagaacatt 
cccaccacgg 
gccccatggt 
agacctagac 
cagcctacca 
atctggatac 
ttctgggagc 
cctggcatca 
agtgtcaatg 
ctaccattcc 
agaggaagag 
ctcccgggaa 
agatgaagat 
ccctaggcca 
cagtgcctct 
tgcaggcaca 
tcctgggggt 
gatgagagct 
aactctacgt 
caggctttxc 
ggagcattta 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 



SUBSTITUTE SHEET (RULE 26) 



BNSDOCID: <WO_ 



_03078662A1J_> 



WO 03/078662 



PCT/US03/077I3 



atggcagcta 
ccagcccctt 
ttgacacaaa 
aggaaaggtt 
cactcctgga 
tggaactagg 
aggaaaccta 
agaagcttaa 
agcacttaac 
tttatcatcc 
tcgggcatgg 
acctgaggca 



gtgcctttag 
ttccccagtc 
attcttatgg 
ttccttattt 
gatatgaagg 
ctcttatgtg 
gcagaggaaa 
aatctgtgaa 
tatgagccag 
ttaaaacaat 
tggctcargc 
aggagtttga 



agggtaccgt 
ccagacaatt 
tatgtagcca 
tgtgtgcttt 
attactctcc 
tgcctttgtt 
gtgtaatttt 
gaaagaggtt 
gcateatact 
tctgtgacat 
ctgtaatctc 
gaccagctta 



3974O-O00lPCT.txt 
ettctcccta ttccctctct 
ccattcaatc tttggaggct 
gctgtgcact ttcttctctt 
cccagtccca ttcctcagct 
atatcccttc ctctcaggct 
tcccatcaga ctgtcaagaa 
ggtttatgac tcttaacccc 
aggagtagat attgattact 
aaacttcacc tacattatct 
acatattatc tcattttaca 
agcactttgg gaggctgagg 
gccaacatag taagaccccc 



ctcccaggtc 
tttaaacatt 
tcccaacccc 
tcttcacagg 
cttgactact 
gaggaaaggg 
ctagaaagac 
atcataattc 
cacttagtcc 
caaagggaag 
cagaaggatt 
atctc 



4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4975 



<210> 356 

<211> 4627 

<212> DNA m . 

<213> Homo sapiens 



<400> 356 
tcacttgcct 
tgacagccgc 
tccgcagccg 
gaggatggag 
tcttctacag 
tgataactgc 
aacaaagtgt 
ggacatgagt 
cttcttttta 
tattattttg 
aaatcgaaaa 
gttgccgcaa 
taatattaat 
cactgtattt 
ttgctaatgt 
aagtaatttc 
agtgcttaga 
ttctgtaagc 
aatcagtgat 
atatattaaa 
acactacaca 
agctataatt 
acaggtacta 
ttctgtaatc 
atattacaac 
cttgaagcct 
tttgctgaat 
aatatatgaa 
tttattcaag 
agttatgtat 
tggtgtattt 
cacaccttcc 
trtttggaag 
aggagtattt 
ggaggctcct 
tcagggagga 
ataataagcc 
catgcattat 
ttactattag 
tatatatatt 
gtcatttggt 
aattagatct 
agaacacatt 
gattcaacca 
attataacaa 
caggttggag 
gatcctcctg 
ctaatttttg 
ctcctaacct 



gatatrtcca 
tctccagcca 
ccctccgcca 
atgctcrgtg 
gcagtcctca 
acagctttag 
agctctgaca 
caaaactact 
accgtccacc 
tttcttatca 
agtaaagaac 
gtctgaatgg 
attccatttt 
taatgtactt 
ataatgtgca 
ctgagctaaa 
agtaagcaat 
ctatatatat 
aattggtttg 
acaataatgc 
tttgaaacaa 
ctgttatatg 
gtcactcatc 
ttttatttaa 
atttatgtga 
aaatttgtgc 
tctaacatta 
gacaataatt 
gaagttttaa 
ttaaagttgt 
ttccaaatga 
tgtggaggct 
agacaaagat 
ttgggctatt 
tcatcgaatg 
tctgttcttc 
atggctgacc 
ataccctggt 
gaaaataaac 
catatacaaa 
gctctgctaa 
aaacagttat 
ttaaatactc 
accaaggacg 
ctatttactt 
tgcagtggca 
cctcagcctc 
tatttttagt 
tgagatccac 



gtgtcagagg 
ctgccgcgag 
agccccagcg 
ccggcagggt 
gtacaactgt 
ttcagacaga 
tgaatggcta 
gcaggtgtga 
aacctttaag 
cagtcgtcgg 
caaagaagga 
cgccatcaaa 
attaataata 
gaaaaatgtt 
gaaaatattt 
tgcttcattg 
tcccaggtca 
agtcaaatcg 
actgtaccat 
aattagaatt 
gctaaccctg 
aggggcagtg 
taccagattc 
gtagtgggca 
ggtaattatt 
tttttaagaa 
aatcacagcc 
ctacatgttg 
ctttaataca 
atcttgacac 
aaaatctcaa 
gagatgaaaa 
ttcttctgca 
gcataaggag 
ctaaaccttt 
ctctacgttt 
tctggagcac 
gcaatcacac 
agacaaaaac 
catgtatgta 
ccatttatat 
ttctgtttcc 
tacttacagt 
gaaaatgctt 
tttttttttt 
cgatctcggc 
ctgagcagct 
agaggcgggg 
cctccacagc 



gacacagcca acgtggggtc 
cccgtctgct cccgccctgc 



cccgctccca tcgccgatga 
ccctgcgctg ctgctctgcc 
gattccatca tgtatcccag 
agacaatcca cgtgtggctc 
ttgtttgcat ggacagtgca 
agtgggttat actggtgtcc 
caaagagtat gtggctttga 
ttccacatat tatttctgca 



atatgagaga gttacctcag 
cttatgggca gggataacag 
tttatgttgg gtcaagtgtt 
tttatttttg ttttattttt 
aatatcaaaa gaaaattgat 
aaagcttcaa agtttatatg 
tagctcaaga attgttagca 
atttagtaag tatgtttttt 
ggtttgatat gtagttggca 
tgggagaagc aaatataggt 
gggagtctat ggtctcttca 
gacagttccc tatgccaact 
tgcctatgta aaatgaattg 
tttcatagct tcacaatgtt 
gctcaacaga caattagaaa 
tatttttaga ctatttcttt 
caaaatttga tggactaatt 
tcttaagatg gaaatacagt 
gctcagtaaa tggcttcttc 
aggaaatggg aaaaaactta 
ttgaaagctt ttaaaatgta 
ctagggctca ttttcctgac 
ctctgagccc ataggtctca 
ccactgctgc caccactttt 
gagtagagtc tccctggatc 
atcctggcat gtgctagggt 
caggtgccag gacttgtctc 
gactgtcatc taaagtcctg 
aagtaaatat atatggtcct 
tacatgacct taatggatca 
aaaacttaaa aacaagagaa 
tatttaatat agctgaagtc 
tggccctctg tggttagttc 
aaaaaataat acaacaacaa 
ctttttgaga tggagtctcg 
tcactgcaac ctcacctccc 
gggactacag gcgcatgcca 
tttcaccatg ttggccagga 
ctcccaaact gctgggatta 
Page 97 



ccttctaggc 
ccgtgcactc 
ccgcggggag 
tgggtttcca 
gagagtccag 
aagtgtcaat 
tctatctggt 
gatgtgaaca 
ccgtgattct 
gatggtacag 
gggatccaga 
tgtgcctggt 
aggtcaataa 
gacagactat 
atttttatac 
ccnggtgcac 
aatgacagat 
atgttcctca 
ccatggtatc 
cctgtgttaa 
ctcaggtctc 
cacgactcct 
aaaaacaatt 
ccttttttgt 
aaagtccaca 
ttataggggc 
attattttaa 
tatttcatct 
tagaatgtaa 
aaaattaata 
gaaacttaaa 
atttgtttat 
gagagttaat 
ggattttatg 
acataccagg 
aaacgaaggc 
catgtgtatc 
gccctggccc 
atacatattg 
tagaattgca 
aagaaaaatc 
aaaatatgta 
cacatctgtg 
caaaaaatac 
ctctgttgcc 
gggttcaaga 
ccatgcccag 
tggtctcaat 
caggcgtgag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 



SUBSTITUTE SHEET (RULE 26) 



BNSDOCiD: <WO 0307B662A1_I_> 



WO 03/078662 



PCT/US03/07713 



ccaccgcacg 

aggaggatgt 

tctgtgcccg 

caaggtacaa 

tgatcttggc 

atttcattta 

gcatgtcaat 

aagctaatcc 

cacattactt 

gagtaaacat 

ttagttcaca 

aaggggctat 

aatttatggt 

atattaagtt 

taaccttaaa 

tgaatatttt 

tggtatgact 

accaccaggt 

gttaaattca 

ccaatatcca 

tgaccgttaa 

tatttctggt 

tcaaatttga 

ttgcactttt 

aacataaagt, 

ttactgaatt 

gatattgaag 

gggaattgaa 

cactgac 



tagcatttac 
gaataggtta 
attttagtat 
ctgtatttat 
tgtttcaaaa 
caaaagattt 
tcataaaaac 
ctaaaatagt 
tttatgggtg 
aaaaactgat 
cttagtctaa 
gcatattcaa 
cttcaaccaa 
ctataactag 
attgatatta 
ttgctctaat 
acgtcttaag 
atcaaatcaa 
gtaatgcagt 
ttctgtagac 
aaaattcttg 
gtatatttta 
tgatgttaaa 
taagtttgaa 
. tgtattgcat 
gtatcaattt 
gagaaaatag 
aaaaattgaa 



3974O-0OOlPCT.txt 
attaggtatt acaagtaatg taaagatgat ttaagtatac 
tatgcaagca ctatgccctt ttatataagt gacttgaaca 
gtgcaggggg gcgatctggg aatcagtccc ctgtggatac 
taacgcttac tagatgtgag gagagtctga atattttcag 
aaatctattg acttttcaat aaatcagctg caatccattt 
attgtaagcc tctcaatctt ggtttttcag ttgatcttaa 
aagtcatttt tgtatttttc atctttaaga atgcttaaaa 
tagatctttg taaatgcata ttaaataata aagtatgacc 
aaaataagac aaaaataata gttttagtga ggatggtgct 
ttgctctcag ctgatgtgtG ctgtacacag tgggaagatt 
ctcccccatt ttacagattt ctcactatat atatttctag 
tgtattgaga accaaagcaa ccacaaatgc ataaatgcat 
ggccacataa taacccagtt aacttactct ttaaccagga 
tactcaaggt ttaaccttaa aattaagatt tccttaacct 
tattaaacat acataataca atgtaactcc actgttctcc 
ctctctgccg aaagtcaaag tgatgggaga attggtatac 
tcagattttt atttatgagt ctttgagact aaattcaatc 
cttttatgca gcaaatatat gattctagtg tctgactttt 
ttttaaaaac ctgtatctga cccactttgt aatttttgct 
ttttgaaaaa aaagttttta atttgatgcc caatatattc 
ttcatatggg agaaggggga gtaatgactt gtacaaacag 
atgtttttaa aaagagtaat ttcatttaaa tatctgttat 
tgtaatataa tgtattttct ttttattttg cactctgtaa 
gagccatttt ggtaaacggt ttttattaaa gatgctatgg 
gcaatttaaa gtaacttatt tgactatgaa tattatcgga 
gtttgtgttc aatatcagct ttgataattg tgtaccttaa 
ataatttaca agatattatt aatttttatt tatttttctt 
ataciataaaa atgcattgaa catcttgcat tcaaaatctt 



3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4627 



<210> 357 
<211> 2634 
<212> DNA 

<213> Homo sapiens 



<400> 357 

ggcacgaggc 

aggaggagga 

tgagctggga 

tcctgcacat 

aggtggaata 

ctggggagaa 

ctaccctttg 

acggaggaac 

aggctacttc 

taggctgtcc 

ccaagtccct 

caagaaatat 

acaagaatac 

cttctaagcc 

tccaggtgac 

ctactatctg 

tgtcagacat 

aggagcgagg 

actcaataga 

tagataaaga 

atgttgctca 

atgatgctaa 

gatttaagcc 

aggtgcaatt 

gagtgatcct 

tgaaggtagc 

tttgcaaagg 

tcgctgcaga 

tgtttcctgg 

acaccagatg 

taatgacagt 

acagtgtcat 



tgagtgtccg tctcgcgccc ggaagcgggc gaccgccgtc 
ggaggaggag gagggggcgg ccatggggct gctgtcccag 
ggaaaccaag cgccatgccg accacgtgcg gcggcacggg 
ctaccacgcc gtcaaggacc ggcacaagga cgttctcaag 
catgttggta tcttttgatc atgaaaataa aaaagtccgg 
agttcttgaa actctgcaag agaaggggga aaggacaaac 
gagaccagag tatgggagtt acatgattga agggacacca 
aatgtccgag ttcaatacag ttgaggccaa catgcgaaaa 
tatattagaa gaaaatcagg ctctrtgcac aataacttca 
tgggttcaca ctgcccgagg tcaaacccaa cccagtggaa 
cttctttcca gatgaagcaa taaacaagca ccctcgcttc 
ccgacatagg agaggagaaa aggttgtcat caatgtacca 
accatctcca tttatagaaa catttactga ggatgatgaa 
ggatcatatt tacatggatg ccatgggatt tggaatgggc 
attccaagcc tgcagtatat ctgaggccag atacctttat 
tccaattgtt atggctttga gtgctgcatc tcccttttac 
tgattgtcgc tggggagtga tttctgcatc tgtagatgat 
actggagcca ttgaagaaca ataactatag gatcagtaaa 
cagctattta tctaagtgtg gtgagaaata taatgacatc 
gatctacgaa cagctgttgc aggaaggcat tgatcatctc 
tctctttatt agagacccac tgacactgtt tgaagagaaa 
tgagtctgac cattttgaga atattcagtc cacaaattgg 
ccctcctcca aactcagaca ttggatggag agtagaattt 
aacagacttt gagaactctg cctatgtggt gtttgtggta 
ttcctacaaa ttggattttc tcattccact gtcaaaggtt 
acagaaaaga gatgctgtct tgcagggaat gttttatttc 
tggcaatgca gtggtggatg gttgtggcaa ggcccagaac 
ggagtacacc ctcatgagca tagacaccat catcaatggg 
actgatccca attctgaact cttaccttga aaacatggaa 
tagtattctg aactacctaa agctaattaa gaagagagca 
tgccagatgg atgagggagt ttatcgcaaa ccatcctgac 
aactqatgaa atgaattata gccttatttt gaagtgtaac 
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agcccggagg 60 
ggctcgccgc 120 
atcctccagt 180 
tggggcgatg 240 
ttggtcctgt 300- 
ccaaaccatc 360 
ggacagccct 420 
cgccggaagg 480 
tttcccagat 540 
ggaggagctt 600 
agtaccttaa 660 
atatttaagg 720 
gcttcaaggg 780 
aattgctgtc 840 
gatcagttgg 900 
cgaggctatg 960 
agaactcggg 1020 
tcccgatatg 1080 
gacttgacga 1140 
ctggcccagc 1200 
atacacctgg 1260 
cagacaatga 1320 
cgacccatgg 1380 
ctgctcacca 1440 
gatgagaaca 1500 
aggaaagata 1560 
agcacggagc 1620 
aaggaaggtg 1680 
gtggatgtgg 1740 
tctggagaac 1800 
tacaagcaag 1860 
caaattgcaa 1920 



SUBSTITUTE SHEET (RULE 26; 



BNSDOCID: <WO 03078662A1 J_> 



WO 03/078662 



PCT/US03/07713 



atgaattatg 

gaagtaaaac 

actggctaca 

ctgtactgtt 

tctagagttt 

aataacatat 

caacctactg 

gagttaaaat 

tgaaattttt 

tgtctacatt 

gcaccttatc 

gcrttcccct 



tgaatgccca 
tgactcatcc 
gtaccatgcc 
ttctgggcca 
atacagtgta 
ctaaagtcat 
tctaagcagt 
gtttactgta 
ctctttaaaa 
aaatcacttg 
tatgatgttt 
ctgaataaat 



gagttacttg 
aactagacat 
tctcagcccg 
gtgagccaga 
catgtacata 
catgaactgg 
tttgtaaatg 
aatttttgtt 
acattttctc 
aatccattga 
cttttgcaat 
acccattgaa 



39740-000lPCT.txt 
gatcagcatt taggaaagta aaatatagtg 
tctacagaaa gaaaaatgca ttattgacga 
tgtqtataat atgaagacca aatgatagaa 
aattgattaa ggctttcttt ggtaggtaaa 
qtaaagtatt tttgattaac aatgtatttt 
cttgtacatt tttaaattct tactctggag 
tactggtaat tgtacaatac ttgcattcca 
cttttaaaga ctacctggga cctgatttat 
tcgttaattt tcctttgtca tttcctttgt 
aagtgcttca agggtaatct tgggtttcta 
tggaataatc acttggtcac cttgccccaa 
ctctgaaaaa aaaaaaaaaa aaaa 



1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2634 



<210> 358 
<211> 1246 
<212> DNA 

<213> Homo sapiens 



<400> 358 

gaceagccta 

cgcgcccccc 

aaaggaggca 

gccgggattg 

atcttcgagc 

atcggagccg 

gagtcccagt 

atagctgcgg 

ttttacaagg 

aaagccatcc 

tcaga'catct 

gccatcaaag 

gccgtggtca 

aaccgcgaga 

ttggtgggat 

ccactaattt 

tcttttaatg 

ggtttatatt 

aaaggtacta 

tttaaataga 

ttgatacata 



cagccgcctg 

agtcccgcac 

ccaagtgcat 

ctgtccttgc 

aagaaactaa 

gcgccctcat 

gcatgctggg 

ccatctgggg 

acacctacaa 

actatgcgtt 

gccccaagaa 

aggtcttcga 

tgatatttgg 

tggtctagag 

tttttgtttg 

tagtattcat 

cttcattcaa 

ttttcagttg 

tatttgctag 

tacaaatgtc 

ataaaaaatt 



catctgtatc 

ccgttcggcc 

caaatacctg 

cattggacta 

taataataat 

gatgctggtg 

actgttcttc 

atattcccac 

caagctgaaa 

gaactgctgt 

ggacgtactc 

caataaattc 

catgatcttc 

tcagcttaca 

tttgttttgt 

tctgcattgc 

tattgacatt 

tttgtttttg 

actctagaca 

tatcaacttt 

atgacaatgt 



cagcgccaigg 

caggctaagt 

ctgttcggat 

tggctccgat 

tccagcttct 

ggcttcctgg 

ggcttcctct 

aaggatgagg 

accaaggatg 

ggtttggctg 

gaaaccttca 

cacatcatcg 

agtatgatct 

tccctgagca 

tttgtttgtt 

tagataaaag 

tgtagttgag 

cttgttatat 

agatattgta 

aatcaagttg 

caaaaaaaaa 



tcccgccagt 

tagccctcac 

ttaacttcat 

tcgactctca 

acacaggagt 

getgctgcgg 

tggtgatatt 

tgattaagga 

agccccagcg 

ggggcgtgga 

ccgtgaagtc 

gcgcagtggg 

tgtgctgtgc 

ggaaagttta 

gtttgttgtt 

ctgaagttac 

cggggggttt 

taagcagaaa 

cataaaagaa 

taacttatat 

aaaaaa 



cccagctgcg 

catgccggtc 

cttctggctt 

gaccaagagc 

ctatattctg 

ggctgtgcag 

cgccattgaa 

agtccaggag 

ggaaacgctg 

acagtttatc 

ctgtcctgat 

catcggcatt 

tatccgcagg 

cccatgaaga 

tgtttttttg 

tttatgtttg 

ggttrgcttt 

tcctgcaatg 

tttttttgtc 

tgaagacaat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1246 



<210> 359 
<211> 2360 
<212> DNA 

<213> Homo sapiens 



<400> 359 

gctacgcggg 

gggcgggctg 

ccaggcttgc 

cgcccagctc 

gcctcctctc 

gctgcagttc 

gctgctggaa 

ggaggcggcc 

gtcactttat 

agaaagaaga 

gacttctgca 

tgccacccgg 

ttatatacct 

tgtctgctgg 

gtggaaaatg 

ctcaaggaat 

gaggggaaag 

tgggcctccc 

tccgatggta 

ccctccacca 

aatgccggtt 



ccacgctgct 

agtgagcaag 

acgcagaggc 

tggtgccagc 

ggtgcgggac 

tctcggcact 

ccagcaaccc 

gggaccggaa 

cctccaagcc 

aaggcaaggg 

tccatggaga 

gttaccatgg 

atgaccacac 

tcatcgtggg 

aagagaaagt 

cggctgggga 

acttccatct 

ataattgctt 

tctgggtaag 

aaccccactt 

aaattccata 



ggctggcctg 

acaagacact 

gggcggcaga 

gcccagtggc 

catgaagctg 

ggtgactggc 

ggaccctccc 

agtccgtgac 

acaagcactg 

gctagggaag 

atgcaaatat 

agagaggtgt 

aaccatcctg 

gcttctcatg 

gaagttgggc 

ctgctacctc 

agtcacaaag 

tgccaaaata 

aagaaagcaa 

cccctcataa 

tgctccagga 



acctaggcgc gcggggtcgg 
caagaagagc gagctgcgcc 
cggtgcccgg cggaatctcc 
cgccgcttcg aaagtgactg 
ctgccgtcgg tggtgctgaa 
gagagcctgg agcggcttcg 
actgtatcca cggaccagct 
ttgcaagagg cagatctgga 
gccacaccaa acaaggagga 
aagagggacc catgtcttcg 
gtgaaggagc tccgggctcc 
catgggctga gcctcccagt 
gccgtggtgg ctgtggtgct 
tttaggtacc ataggagagg 
atgactaatt cccactgaga 
tgagaagaca caaggtgatt 
actccttcgt ccccagttgc 
ccagagcctt caagtgccaa 
aagcaaggga ccttcatgcc 
gtttgtttaa acacttatct 
tctttgactg aaaaaaaaaa 
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gcggccgcgc 
tgggtcccgg 
tgagctccgc 
gtgcctcgcc 
gctctttctg 
gagagggcta 
gctaccecta 
ccttttgaga 
gcacgggaaa 
gaaatacaag 
ctcctgcatc 
ggaaaatcgc 
gtcatctgtc 
aggttatgat 
gagacttgtg 
tcagactgca 
cgtctaggat 
acagagtatg 
cttctgattc 
tctggattag 
agaagaagaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720. 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



SUBSTITUTE SHEET (RULE 26) 



BNSDOCID: <WO 03078662A1_I_> 



WO 03/078662 



PCT/USOJ/07713 



gaaggagagc 
atgtactcaa 
gttaactgtg 
aaataacaac 
tattetatgt 
tagactgtta 
tgccacccac 
gtctgtctac 
gtcaggatgt 
taggcgattt 
cagatatcag 
atgcctgtag 
ctttcccttg 
aggggtcagt 
ggaacaggga 
agaaggttgg 
gtatgctgtc 
tatttccaaa 
ataaaaaaaa 



aagaaggaaa 
gtaccaccaa 
aaataccaca 
tatttttgtt 
atgttaattt 
ctttggcaat 
ctttgccaca 
atttctgcag 
aggggttaac 
tgtctaccat 
tgtqtccccg 
ctttcctggt 
tgtcaagaca 
gcagcagagg 
acattggagc 
aggtggggaa 
atggtccttt 
cctggttcat 
aaaaaaaaaa 



39740-000lPCT.txt 
gatttgtgaa ctggaagaaa gcaacaaaga ttgaigaagcc 
gggatctgcc attgggaccc tccagtgctg gatttgatga 
agcctgagaa ctgaattttg ggacttctac ccagatggaa 
gttgttgttt gtaaatgcct cttaaattat atatttattt 
atttagtttt taacaatcta acaataatat ttcaagtgcc 
ttcctggccc tccactcctc atccccacaa tctggcttag 
aagctaggat ggttctgtga cccatctgta gtaatttatt 
atcttccgtg gtcagagtgc cactgcggga gctctgtatg 
ttggtcagag ccactctatg agttggactt cagtettgcc 
ttgtgttttg aaagcccaag gtgctgatgt caaagtgtaa 
tgtcctctcc ctgccaagtc tcagaagagg ttgggcttcc 
ccctcacccc catggcccca ggccacagcg tgggaactca 
tttctctaac tcctgccatt cttctggtgc tactccatgc 
acagtctgga gaaggtatta gcaaagcaaa aggctgagaa 
tgactgttct tggtaactga ttacctgcca attgctaccg 
ggctttgtat aatcccaccc acctcaccaa aacgatgaag 
ctggaagttt ctggtgccat ttctgaactg ttacaacttg 
atttatactt tgcaatccaa ataaagataa eccttattcc 



1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2360 



<210> 360 
<211> 1433 
<212> DNA 

<213> Homo sapiens 



<400> 360 

attcggggcg 

cctgcccgcc 

ctgccgagag 

ccgctgctgc 

ctgttccgct 

gcgccgcccg 

gtccgggagc 

ggcgtctaca 

ctgcccctgc 

tatggcgcca 

gtggagaacc 

aagcccctca 

caccggcaga 

cgaccacccc 

tccaccatgc 

cccaactgtg 

cagcgtgggg 

accatccggg 

gtgcacaccc 

gcccctctcc 

ttccagttct 

cccggcctct 

gaggaagggg 

tttatttttg 



agggaggagg 

cgcccgctcg 

tgggctgccc 

tgctgctact 

gcccgccctg 

ccgcggtggc 

cgggctgcgg 

ccccgcgctg 

aggcgctggt 

gcccggagca 

acgtggacag 

agtcgggtat 

tgggcaaggg 

ctgccaggac 

gccttccgga 

acaagcatgg 

agtgctggtg 

gggaccccga 

agcggatgca 

aaacaccggc 

gacacacgta 

ctcttcccag 

gttgtggtcg 

aacccctgtg 



aagaagcgga 

ctcgctcgcc 

cgcgctgccg 

gggcgcgagt 

oacacccgag 

cgcagtggcc 

ctgctgctcg 

cggccagggg 

catgggcgag 

ggttgcagac 

caccatgaac 

gaaggagctg 

tggcaagcat 

tccctgccaa 

tgagcggggc 

cctgtacaac 

tgtgaacccc 

gtgtcatctc 

gtagaccgca 

agaaaacgga 

tttatatttg 

ctgcagatgc 

gggagctggg 

tcccttttgc 



ggaggcggct 

cgccgcgccg 

ctgccgccgc 

ggcggcggcg 

cgcctggccg 

ggaggcgccc 

gtgtgcgccc 

ctgcgctgct 

ggcacttgtg 

aatggcgatg 

atgttgggcg 

gccgtgttcc 

caccttggcc 

caggaactgg 

cctctggagc 

ctcaaacagt 

aacaccggga 

ttctacaatg 

gccagccggt 

gagtgcttgg 

gaaagagacc 

cacacctgct 

gtacaggttt 

ataagattaa 



cccgctcgca 
cgctgccgac 
cgccgctgct 
gcggggcgcg 
cctgcgggcc 
gcatgccatg 
ggctggaggg 
atccccaccc 
agaagcgccg 
accactcaga 
ggggaggcag 

gggagaaggt 
tggaggagcc 
accaggtcct 
acctctactc 
gcaagatgtc 
agctgatcca 
agcagcagga 
gcctggcgcc 
gtggtgggtg 
agcaccgagc 
ccttcttgct 
ggggaggggg 
aggaaggaaa 



gggccgtgca 

cgccagcatg 

gecgctgctg 

cgcggaggtg 

cccgccggtt 

cgcggagctc 

cgaggcgtgc 

gggctccgag 

ggacgccgag 

aggiggcctg 

tgctggccgg 

cactgagcag 

caagaagctg 

ggagcggatc 

cctgcacatc 

tctgaacggg 

gggagceccc 

ggcttgcggg 

cctgcccccc 

ctggaggatt 

tcggcacctc 

ttccccgggg 

aagagaaatt 

agt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1433 



<210> 361 
<211> 1632 
<212> DNA 

<213> Homo sapiens 



<400> 361 

gccggccgaa 

gcgacctccg 

aggaaagcgc 

gtcgagtagg 

tgaggtgggg 

cgggcaggcc 

tccttgctgc 

aggcacagtg 

gatggaggag 

acagggaagg 

aagaattcag 



cccagacccg aggttttaga agcagagtca ggcgaagctg 
caaccttgag cggcatccgt ggagtgcgcc tgcgcagcta 
cgccggccag gcccagctgt ggccggacag ggactggaag 
tgtgcaccag ccctggcaac gagagcgtct accccgaact 
aagccgggga gggcagttga ggaccccgcg gaggcgcgtg 
agcctccgag ccgggtggac acaggtttta aaacatgaat 
cttttgcctg ggaattgcct cagctactct aacatttgat 
gaccaagtgg aaggcgatgc acaacagatt atacggcatg 
agcagtgtgg gagaagaaca tgaagatgat tgaactgcac 
gaaacacagc ttcacaatgg ccatgaacgc ctttggagac 
gcaggtgatg aatggctttc aaaaccgtaa gcccaggaag 
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ggccagaacc 60 
cgaccgcagc 120 
agaggacgcg 180 
ctgctggcct 240 
actggttgag 300 
cctacactca 360 
cacagtttag 420 
aatgaagaag 480 
aatcaggaat 540 
atgaccagtg 600 
gggaaagtgt 660 



SUBSTITUTE SHEET (RULE 26) 



BNSDOCID: <WO 03078662A1_I_> 



WO 03/078662 



PCT/US03/07713 



tccaggaacc 

tgactcctgt 

ctcttgaagg 

tggtagactg 

ctttccagta 

caacagaaga 

tggacatccc 

ctgttgctat 

agccagactg 

aaagcacaga 

ggggcatggg 

cagcagccag 

ggatggcgca 

cacactcgaa 

aaatgttacc 

gactttgcat 

atttcaaatg 



tctgttttat 

gaagaatcag 

acagatgttc 

ctctgggcct 

tgttcaggat 

atcctgtaag 

taagcaggag 

tgatgcaggt 

tagcagtgaa 

atcagataac 

tggctacgta 

ctaccccact 

tgcatgggag 

tcattgaaga 

tctattttaa 

tttcgttttt 

ta 



39740-000lPCT.txt 
qaggccccca gatctgtgga ttggagagag 
gqtcagtgtg gttcttgttg ggcttttagt 
cggaaaactg ggaggcttat ctcactgagt 
caaggcaatg aaggctgcaa tggtggccta 
aatggaggcc tggactctga ggaatcctat 
tacaatccca agtattctgt tgctaatgac 
aaggccctga tgaaggcagt tgcaactgtg 
catgagtcct tcctgttcta taaagaaggc 
gacatggatc atggtgtgct ggtggttggc 
iataaatatt ggctggtgaa gaacagctgg 
aagatggcca aagaccggag aaaccattgt 
gtgtgagctg gtggacggtg atgaggaagg 
qaattcatct tcagtctacc agcccccgct 
tccgagtgtg atttgaattc tgtgatattt 
ttactgctat aaataggttt atattattga 
aaaaggatgt ataaattttt acctgtttaa 



aaaggctacg 

gctactggtg 

gagcagaatc 

atggattatg 

ccatatgagg 

accggctttg 

gggcecattt 

atttattttg 

tacggatttg 

ggtgaagaat 

ggaattgcct 

acttgactgg 

gtgtcggata 

tcacactggt 

ttcacttact 

ataaaattta 



720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1632 



<210> 362 

<211> 2756 

<212> DNA 

<213> Homo sapiens 

<400> 362 
atgctgtcct 
ggtcataaaa 
gcctttgtgg 
ggtttataca 
aagaaaggaa 
tatgtacagg 
gacgatggca 
tatggtctga 
gccaaagagc 
gatgggaagg 
gagacatcta 
gaagatattt 
tctgatgatg 
ctctggtact 
aaactgattg 
acattcaaga 
cctgaagagt 
atagcttgtg 
ctgcagctcc 
agcattgtag 
tttttatctc 
gttttccgag 
ttctacccta 
aaattggatg 
acacccaact 
gtggccaaca 
gccaacaaac 
ggttacacat 
gctgcttgtg 
atggacatgc 
tgctcggaca 
aagttaccag 
gatgaccgcg 
ggccgcagca 
ggggcgggga 
gcgcggtgcc 
. gacagaaaac 
ccccacggga 
ttttatcttt 
tctaggcaca 
tggcaacaaa 
tatctatatc 
acttctttgc 
tcatattgag 



tccagtaccc 
tttgtgaccc 
aggcccagaa 
aagagagaat 
aacggtattt 
attccttaga 
cagtggcact 
gtgccagtgg 
ttccagatgt 
gaatgttcta 
ccaatctcca 
tgtgtgctga 
gccgctatgt 
gtgacctaca 
acaactttga 
cgaatcgcca 
ctaagtggaa 
tcaggtccaa 
atgacctgac 
ggtacagcgg 
caggtatcat 
aggtgaccgt 
gcaaggatgg 
gctctcatcc 
acagtgtttc 
tcagaggagg 
aaaactgctt 
ctcccaagag 
caaatcagag 
tgaagtttca 
gcaaacaaca 
aagcagatga 
tggtcccgct 
ggaagcaaag 
agcccacagc 
tgaacgtcga 
ctcaagggct 
acattattcc 
tttaggcttc 
tgtggttttt 
cacattgtga 
cataacctct 
aacaataata 
gggcaactcc 



cgacgtgtac 
ttacgcctgg 
taagattact 
gactgaacta 
ttatttttac 
gggtgaggcc 
ccgaggttat 
ctcagactgg 
gcttgaaaga 
caactcatac 
ccaaaagctc 
gtttcctgat 
cttgttatca 
gcaggaatcc 
aggggaatat 
gtctcccaac 
agtacttgtt 
cttcttggtc 
tactggtgct 
tcagaagaag 
ttatcactgt 
aaaaggaatt 
tacgaagatt 
agctttctta 
caggcttatt 
tggcgaatat 
tgatgacttt 
gctgactatt 
acctgacctc 
taaatatacc 
ctttgaatgg 
catccagtac 
tcactccctg 
caaccccctg 
caaagtgata 
ctggattcca 
ttcccacgtt 
tggactgaca 
tcctttttag 
tggtgttttt 
atattagatt 
atatctttaa 
aatgttattt 
agaagtggag 



cgcgacgaga ccgccgtaca 
cttgaagacc ccgacagtga 
gtgccatttc ttgagcagtg 
tatgattatc ccaagtatag 
aatacaggtt tgcagaacca 
agagtgttcc tggaccccaa 
gcgttcagcg aagatggtga 
gtgacaatca agttcatgaa 
gtcaagttca gctgtatggc 
cctcaacagg atggaaaaag 
tactaccatg tcttgggaac 
gaacctaaat ggatgggtgg 
ataagggaag gatgtgatcc 
agtggcatcg cgggaatcct 
gactacgtga ccaatgaggg 
tatcgcgtga tcaacattga 
cctgagcatg agaaagatgt 
ttatgctacc tccatgacgt 
ctccttaaga ccttcccgct 
gacactgaaa tcttctatca 
gatcttacca aagaggagct 
gatgcttctg attaccagac 
ccaatgttca ttgtgcataa 
tatggctatg gcggcttcaa 
tttgtgagac acatgggtgg 
ggagagacgt ggcataaagg 
cagtgtgctg ctgagnatct 
aatggaggtt caaatggagg 
tttggttgtg ttattgccca 
atcggccatg cttggaccac 
cttgtcaaat actctccatt 
ccgtccatgc tgctcctcac 
aagttcattg ccacccttca 
cttatccacg tggacaccaa 
gaggaagtct cagacatgtt 
taaacagttt tcgtgcttcc 
gacaccaaga aaccactggg 
ggctacagtt gaacagaact 
caaggccttg gtgtttcttt 
tttaagggca tgttgggata 
gctgaattaa ggatcatagt 
ataaatgtga gaactgttct 
aagaatgaca gggatttact 
ttttctgtga gaataaagca 
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ggattatcat 
acagactaag 
tcccatcaga 
ttgccacttc 
gcgagtatta 
catactgtct 
atattttgcc 
agttgatggt 
ctggacccat 
tgatggcaca 
cgatcagtca 
agctgagtta 
agtaaaccga 
gaagtgggta 
ggcggtgttc 
cttcagggat 
cttagaatgg 
caagaacatt 
cgatgtcggc 
gtttacttcc 
ggagccaaga 
agtccagatt 
aaaaageata 
catatccatc 
tatcctggca 
tggtatcttg 
gatcaaggaa 
cctcttagtg 
agttggagta 
tgattatggg 
gcataatgtg 
tgctgaccat 
gtacatcgtg 
ggcgggccac 
tgcgttcatc 
tcctgacagc 
cataatgctt 
gccgtgggaa 
ttccaccctg 
aatagctaaa 
cgggcatact 
catggagaag 
tccggtttct 
tttcaccttt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 



SUBSTITUTE SHEET (RULE 26) 



CID: <WO_ 



_0307B662A1J_> 



WO 03/078662 



PCT/US03/077I3 



3974O-OO0lPCT.txt 

ctqcaacaag ttagttttca agcagttaag tcatagaatg tttgttagct gtgaaaataa 2700 
gttgttcatc caaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaag gaattc Z75b 

<210> 363 
<211> 2768 
<212> DNA 

<213> Homo sapiens 



<400> 363 
cactgctgtg 
tgaaaggaag 
acttggccat 
gaacccattc 
ttttgggaaa 
aaaagtatgg 
atcctgacat 
ggccttttgg 
ggaagagatt 
tccctatcat 
caggcaagcc 
gcacatcatt 
acaccaagaa 
ttccattcct 
caaatttttt 
agcaccgagt 
cccacaaagc 
gctatgaaac 
atgtccagca 
cctatgatac 
tattcccaat 
tgttcattcc 
agtactggac 
acatagatcc 
ggtttgctct 
aaccttgtaa 
aaaaacccgt 
tcctaaggac 
aattactttg 
taaccgggga 
gggaatataa 
cacgggacta 
agaatcaaca 
gatggctcta 
tgttaaatca 
gagaaatcta 
tgttggcgtg 
ccactgtgac 
atcaggcaca 
cgcctgaggt 
aaaaatacac 
tgaggcagga 
actgcactcc 
cacgtacatt 
ttttcagccc 
tgcattccat 
gattggtc 



cagggcagga 
actcagagga 
ggaaacctgg 
acatggactt 
tattttgtcc 
aaaagtgtgg 
gatcaaaaca 
tccagtggga 
acgatcattg 
tgcccagtat 
tgtcaccttg 
tggagtgaac 
gcttttaaga 
catcccaatt 
aagaaaatct 
ggatttcctt 
tctgtccgat 
cacgagcagt 
gaaactgcag 
tgtgctacag 
tgctatgaga 
caaaggggtg 
agagcctgag 
ttacatatac 
catgaacatg 
agaaacacag 
tgttctaaag 
ttctgctttg 
tgaatagaac 
ttctgtacat 
aggaggtgac 
tttccaccac 
tttctcaata 
acaatgacat 
ataaagacca 
tagaactgaa 
gggcctttgt 
tttgcccatt 
gtggctcacg 
caggagttca 
aaattagcta 
gaatcacttg 
agtctgggtg 
ttgtaacaaa 
atctcctttc 
cctcaacacc 



aagctccatg 
gagagataag 
cttctcctgg 
tttaagaagc 
taccataagg 
ggcttttatg 
gtgctagtga 
tttatgaaaa 
ctgtctccaa 
ggagatgtgt 
aaagacgtct 
atcgactctc 
tttgattttt 
cttgaagtat 
gtaaaaagga 
cagctgatga 
ctggagctcg 
gttctctcct 
gaggaaattg 
atggagtatc 
cttgagaggg 
gtggtgatga 
aagttcctcc 
acaccctttg 
aaacttgctc 
atccccctga 
gttgagtcaa 
ctcttcaaga 
tctgaaatga 
gcattgagct 
caaatcagtg 
ccccagttag 
atttcctcca 
ttatatcaca 
ctttacaaaa 
tgagaaccaa 
cagaactaga 
gtttagaaag 
cctgtaatcc 
agacaagcct 
ggcatggtgg 
aacctgggag 
agagtgagac 
gaactgaagc 
atatttctgg 
caactgtctc 



cacatagccc 
taaggaaagt 
ctgtcagcct 
ttggaattcc 
gcttttgtat 
atggtcaaca 
aagaatgtta 
gtgccatctc 
ccttcaccag 
tggtgagaaa 
ttggggccta 
tcaacaatcc 
tggatccatt 
taaatatctg 
tgaaagaaag 
ttgactctca 
tggcccaatc 
tcattatgta 
atgcagtttt 
ttgacatggt 
tctgcaaaaa 
ttccaagcta 
ctgaaagatt 
gaagtggacc 
taatcagagt 
aattaagctt 
gggatggcac 
aatctgtgcc 
agatgggctt 
ctctcattgt 
tgaggaggta 
caccattaac 
caaattatta 
tgttttctct 
gtattatcag 
caagtaaata 
atttgattat 
aatattcata 
tagcagtttg 
ggcctacatg 
actcgcctgt 
gcggatgttg 
tcagtcttaa 
tcttattata 
gagacagaaa 
gatgcaatga 



agcaaagagc 
agtgatggct 
ggtgctcctc 
agggcccaca 
gtttgacatg 
gcctgtgctg 
ttctgtcttc 
tatagctgag 
tggaaaactc 
tctgaggcgg 
cagcatggat 
acaagacccc 
ctttctctca 
tgtgtttcca 
tcgcctcgaa 
gaattcaaaa 
aattatcttt 
tgaactggcc 
aeccaataag 
ggtgaatgaa 
agatgttgag 
tgctcttcac 
cagcaagaag 
cagaaactgc 
ccttcagaac 
aggaggactt 
cgtaagtgga 
tgagaacacc 
catccaatgg 
ctgtgtagag 
gatttggctc 
tcctcctgag 
atgaaaataa 
ggagtattct 
atgctttcct 
tttttggtca 
taacataggt 
gtttaattat 
ggaggctgag 
gttgaaaccc 
aatctcacta 
aagtgagctg 
aaaaatatgc 
ttattagttt 
acatgtttcc 
acacttaata 



aacacagagc 
ctcatcccag 
tatctatatg 
cctctgcctt 
gaatgtcata 
gctatcacag 
acaaaccgga 
gatgaagaat 
aaggagatgg 
gaagcagaga 
gtgatcacta 
tttgtggaaa 
ataacagtct 
agagaagtta 
gatacacaaa 
gaaactgagt 
atttttgctg 
actcaccctg 
gcaccaccca 
acgctcagat 
atcaatggga 
cgtgacccaa 
aacaaggaca 
attggcatga 
ttctccttca 
cttcaaccag 
gcctgaattt 
agagacctca 
actgcataaa 
tgttatactt 
ctctgcttct 
ctctgataag 
gaattatttt 
ataagtttta 
gcacattaag 
ttgtaatcac 
gaaagttaat 
gccttttttg 
ccgggtggat 
catctctact 
cacaggaggc 
agattgcacc 
ctttttgaag 
tgatttaatg 
ctacacctct 
aaaaacagtc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2768 



<210> 364 

<211> 2984 

<212> DNA 

<213> Homo sapiens 



<400> 364 

gaggaggaac 

agaaagcaag 

tctggagttt 

cctgggcggc 

ggacctgggt 



agaaaagaaa agaaaagaaa aagtgggaaa caaataatct 
aagagtgacc cccttgtggg cactccattg gttttatggc 
gtgtaaaaca aaaatattat ggtctttgtg cacatttaca 
acagccagat gcgagatgcg tctctgctga tctgagtctg 
cttccctgaa gcatctccag ggctggaggg acgactgcca 
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aagaatgagg 60 
gcctctactt 120 
tcaagctcag 180 
cctgcagcat 240 
tgcaccgagg 300 



SUBSTITUTE SHEET (RULE 26) 



BNSOOCID: <WO_ 



_03078662A1_L> 



WO 03/078662 



PCT/US03/07713 



gctcatccat 
cctgatctgt 
caagcccacc 
caggtgtcag 
accctggatt 
catcacctgg 
ctcagagagc 
ctcagcccag 
acaggtggca 
cctgaactcc 
gagcccgagt 
gtggtctcta 
actctcagtg 
ctctgatgct 
gctcgctggc 
gagccgctcc 
gtcggccccc 
cctctcggtg 
gtcacaggga 
gcgtctaaga 
gacctcagcc 
gctgactcac 
ctccccgaca 
cgggtcggat 
ggccgtcatc 
tcagggcaaa 
tgtggggcca 
ccaggaagaa 
ggacactcgg 
ctccagacct 
cacaaaggac 
agccccccag 
tgagcctcct 
ggccatccac 
gggagctgcc 
actctgggaa 
ttttgaaata 
gaaagtgcat 
actacaagcc 
ctttcctata 
gggaggccga 
tgaaaccctg 
tcccagctat 
agtgagccaa 
attaaaaaaa 



ccacagagca 
ctcgggctga 
ctctgggctg 
gggggccagg 
acacggatcc 
gaacatgcag 
agtgaccccc 
cccagccccg 
tttgatggct 
cagccccatg 
cgcaggtggt 
cccagtgatc 
cagccaggtc 
ggctacaaca 
gcacagcccc 
tacgggggcc 
agcgaccccc 
cagccgggcc 
tggatgcaaa 
tcaacgtacc 
catgcgggga 
cccagtgacc 
acaggcccca 
ccccagagtg 
ctactgctcc 
cactggacat 
gagcccacag 
aacctctatg 
agcccacacg 
aggagagaaa 
agacaggcgg 
gatgtgacct 
ccatcccagg 
tagcccaggg 
cccccagtgg 
cttttagggg 
aagcaacaga 
taaactgaat 
acgaatgaat 
agaaatttag 
ggcgggcaga 
tctctcctaa 
ttgggaggct 
gatcgcacca 
aaaaaaaaaa 



39740-000lPCT.txt 
gggcagtggg aggagacgcc atgaccccca 
gtctgggccc ccggacccac gtgcaggcag 
aaccaggctc tgtgatcacc caggggagtc 
agacccagga gtaccgtcta tatagagaaa 
cacaggagct tgtgaagaag ggccagttcc 
ggcggtatcg ctgttactat ggtagcgaca 
tggagctggt ggtgacagga gcctacatca 
tggtgaactc aggagggaat gtaaccctcc 
tcattctgtg taaggaagga gaagatgaac 
cccgtgggtc gtcccgcgcc atcttctccg 
ggtacaggtg ctatgcttat gactcgaact 
tcctggagct cctggtccta ggtgtttcta 
ctatcgtggc ccctgaggag accctgactc 
gatttgttct gtataaggac ggggaacgtg 
aggctgggct ctcccaggcc aacttcaccc 
agtacagatg ctacggtgca cacaacctct 
tggacatcct gatcgcagga cagttctatg 
ccacggtggc ctcaggagag aacgtgaccc 
ctttccttct gaccaaggag ggggcagctg 
aatctcaaaa ataccaggct gaattcccca 
cctacaggtg ctacggctca cagagctcca 
ccctggagct cgtggtctca ggaccgtctg 
cctccacatc tggccctgag gaccagcccc 
gtctgggaag gcacctgggg gttgtgatcg 
tcctcctcct cctcctcttc ctcatcctcc 
cgacccagag aaaggctgat ttccaacatc 
acagaggcct gcagtggagg tccagcccag 
ctgccgtgaa gcacacacag cctgaggatg 
atgaagaccc ccaggcagtg acgtatgccg 
tggcctctcc tccttcccca ctgtctgggg 
aagaggacag gcagatggac actgaggctg 
acgcccagct gcacagcttg acccttagac 
aagggccctc tccagctgtg cccagcatct 
ggggacgcag accccacact ccatggagtc 
acaccattgg accccaccca gcctggatct 
tcactcaatt ctgcagtata aataactaat 
cttctcaata atcaatgaag tagctgagaa 
cacaatgtaa atantacaca tcaagcgatg 
gaattaggaa agaaaaaaag taggaaatga 
ggcagggcac ggtggctcac gcctgtaatt 
tcacgagttc aggagatcga gaccatcttg 
aaatacaaaa attagctgga tgtggtggca 
gaggcaggag aatcgcttga accagggagt 
ctgctctcca gcctggcgac agagggagac 
agaaagaaaa aaaaaaaaaa aaaa 



tcctcacggt 
ggcacctccc 
ctgtgaccct 
agaaaacagc 
ccatcccatc 
ctgcaggccg 
aacccaccct 
agtgtgactc 
acccacaatg 
tgggccccgt 
ctccctatga 
agaagccatc 
tgcagtgtgg 
acttccttca 
tgggccctgt 
cotccgagtg 
acagagtctc 
tgctgtgtca 
atgacccatg 
tgggtcctgt 
aaccctacct 
ggggccccag 
tcacccccac 
gcatcttggt 
gaeatcgacg 
ctgcaggggc 
ctgccgatgc 
gggtggagat 
aggtgaaaca 
aattcctgga 
ctgcatctga 
ggaaggcaac 
acgccactct 
tggaatgcat 
accccaggag 
gtctctacaa 
aactaagtca 
aaactggaaa 
atgatcttgg 
ccagcacttt 
gccaacatgg 
gtgcctgtaa 
cagaggtttc 
tccatctcaa 



360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
2984 



<210> 365 
<211> 3061 
<212> DNA , 
<213> Homo sapiens 



<400> 365 

cggcacgagg 

tgctggacga 

aggaagacca 

ggctgattgt 

tgaacaatgc 

ccattgatgc 

ttggctccaa 

gtgtggaggg 

actactgtcc 

tggcctttcc 

cagaatatgg 

agaaggcccc 

tggataaagc 

gaaagaaggg 

agcagatgag 

agttcagtaa 



cgactttggt 
tgtggagctg 
gggaatttat 
caatgtgaat 
ctttgaggag 
tacctatgcc 
gcacgtctcc 
cattgtcact 
tgctactaag 
ctccagctct 
cctttctgtc 
agccggccag 
gaagcctggt 
aggctacacc 
caaggatgct 
aacccgatcc 



ggaggtagtt ctttggcagc gggcatggcg 
cgggaggctc agagagatta cctggacttc 
cagagcaaag ttcgggagct gatcagtgac 
gacctgcgca ggaaaaacga gaagagggct 
ctggttgcct tccagcgggc cttaaaggat 
aagcagtatg aggagttcta cgtaggactg 
ccgcggactc ttacctcctg cttcctcagc 
aaatgttctc tagttcgtcc caaagtcgtc 
aagaccatag agcgacgtta ttctgatctc 
gtctatccta ccaaggatga ggagaacaat 
tacaaggatc accagaccat caccatccag 
ctcccccgct ctgtggacgt cattctggat 
gaccgggttc aggtggtggg aacctaccgt 
tctgggacct tcaggactgt cctgattgcc 
cagccctctt tctctgctga ggatatagcc 
aaggatatct ttgaccagct ggccaagtca 
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ggtaccgtgg 60 
ctggacgacg 120 
aaccaatacc 180 
aaccggcttc 240 
tttgtggcct 300 
gaaggcagct 360 
tgtgtggtct 420 
cgcagtgtcc 480 
accaccctgg 540 
ccccttgaga 600 
gagatgccgg 660 
gatgacttgg 720 
tgccttcctg 780 
tgtaatgtta 840 
aagatcaaga 900 
ttggccccaa 960 



SUBSTITUTE SHEET (RULE 26) 



BNSDOCID: <WO 03078662A1_I_> 



WO 03/078662 



PCT/tiSOJ/07713 



gtatccatgg 

aacgagacct 

acccatccgt 

tccccaccac 

aggaaacagg 

tttgcatfga 

tggagcaggg 

gcagtgtttt 

tggagaacat 

tggatcagat 

gttacagagc 

tcctggccac 

agaagcatga 

cattcatgaa 

cggccaccta 

acaccgccag 

cagcccatgc 

tggagttggt 

agcgaagtga 

agcagaagag 

acccctatga 

cagactcaca 

cattcaaggt 

atcgcctcac 

aggctgctct 

tcctcatctg 

tttcccaccc 

actgaaggcg 

atgggtcatg 

tattgcatct 

ccaattgtaa 

ctgttttttt 

gttccaattg 

cagtagcacc 

aatatccgag 



39740-000lPCT.txt 
gcatgactat gtcaagaaag caatcctctg cttgctcttg 
agaaaatggc agccacatcc gtggggacat caatattctt 
tgccaagtct cagcttctgc ggtatgtgct ttgcactgca 
tggccggggc tcctctggag tgggtctgac ggctgctgtc 
aqagcgccgt ctggaagcag gggccatggt cctggctgac 
tgaatttgac aaaatgtctg acatggatcg cacagccatc, 
tcgagtgacc attgccaagg ctggcatcca tgctcggctg 
ggcagctgcc aaccctgtct acggcaggta tgaccagtat 
tgggctacag gactcactgc tgtcacgatt tgacttgctc 
ggatcctgag caggatcggg agatctcaga ccatgtcctt 
acctggggag caggatggcg atgctatgcc cttgggtagt 
agatgatccc aactttagcc aggaagatca gcaggacacc 
caaccttcta catgggacca agaagaaaaa ggagaagatg 
gaagtacatc catgtggcca aaatcatcaa gcctgtcctg 
cattgcagaa gagtattcac gcctgcgcag ccaggatagc 
gacatctcca gttacagccc gaacactgga aactctgatt 
gaaggcccgc atgagcaaga ctgtggacct gcaggatgca 
ccagtatgct tactttaaga aggttctgga gaaggagaag 
ggatgaatca gagacagaag atgaagagga gaaaagccaa 
gaagagaagg aagactcgcc agccagatgc caaagatggg 
cttcagtgac acagaggagg aaatgcctca agtacacact 
ggagaccaag gaatcccaga aagtggagtt gagtgaatcc 
ggccctcttg gatgtgttcc gggaagctca tgcgcagtca 
agaatccatc aaccgggaca gcgaagagcc cttctcttca 
gagcaagatg caggatgaca atcaggtcat ggtgtctgag 
aggaggcctc gtctetgaac ttgggttgtg ccgagagagt 
tctccctgac ccaagtcttt gcctctactc ccttaacagt 
aggaatgttg gtgatgaagc tgagttcagg actcggtgga 
aaagctgcca tggggtgagg aaagaggaga cagtgggaga 
tcattgcaaa agcactggct catccgccct acttcccatc 
ataacatatg acttctgagt acttttgggg gcacaactgt 
gttttgtttt ttttctccag agcactttgg tctagactag 
gtggagagaa gctctgaggc acgtcatgca ggtcaagaaa 
agttaaggtg aatatgtatt gtatcacaaa acaaacccaa 
atgttgaata aacttagcca tttcgtacaa aaaaaggggg 



ggaggggtgg 1020 
ctaataggag 1080 
ccccgagcta 1140 
accacagacc 1200 
cgaggcgtgg 1260 
catgaagtga 1320 
aatgcccgct 1380 
aagactccaa 1440 
ttcatcatgc 1500 
cggatgcace 1560 
gctgtggata 1620 
cagatttatg 1680 
gtgagtgcag 1740 
acacaggagt 1800 
atgagctcag 1860 
cgactggcca 1920 
gaggaagctg 1980 
aaacgtaaga 2040 
gaggaccagg 2100 
gattcatacg 2160 
ccaaagacgg 2220 
aggttgaagg 2280 
atcggcatga 2340 
gttgagatcc 2400 
ggcatcatct 2460 
ttgttctgtg 2520 
gttgaattca 2580 
ccctttggga 2640 
ggacaatgac 2700 
ccacacaaac 2760 
tttctgtttg 2820 
gctttgggtg 2880 
gctttctttg 2940 
tatccagatg 3000 
gcccggtaaa 3060 
3061 



<210> 366 
<211> 1360 
<212> DNA 

<213> Homo sapiens 



<400> 366 

cgggggttgc 

cggccacgtc 

cttttcgcgc 

cccgctttgt 

ccgcgtccga 

gcctggcctc 

tgaagatccg 

actacacgac 

ggattgtcct 

agacggaaca 

tggatgagct 

agctggccta 

gaggccaggt 

gtgacatgcg 

ggttcaccag 

agatgagcag 

tgcagtcaca 

gctttggagc 

cggatgtgcg 

agtcgcggct 

actacaacaa 

gtcctttgga 

tcttctcctg 

<210> 367 



tccgtccgtg 

gtccttcgga 

gcccagcatt 

gtcctcgtcc 

cgggctgctg 

ctacctggac 

cgactggtac 

catccaggac 

gcagatcgac 

ggctctgcgc 

gaccctggcc 

cctgaagaag 

cagtgtggag 

aagccaatat 

ccggactgaa 

gtccgaggtt 

gctgagcatg 

ccagctggcg 

agctgatagt 

ggagcaggag 

tttgtctgcc 

gggtgtcttc 

acctgccaat 



ctccgcctcg 

ggcctgggcg 

cacgggggct 

tcctcggggg 

gcgggcaacg 

aaggtgcgcg 

cagaagcagg 

ctgcgggaca 

aacgcccgtc 

atgagcgtgg 

aggaccgacc 

aaccatgagg 

gtggattccg 

gaggtcatgg 

gaattgaacc 

actgacctgc 

aaagctgcct 

catatccagg 

gagcggcaga 

attgccacct 

tccaaggtcc 

tgggtagagg 

aaaaatttat 



ccatgacttc 

gcggctccgt 

ccggcggccg 

gctacggcgg 

agaagctaac 

ccctggaggc 

ggcctgggcc 

agattcttgg 

tggctgcaga 

aggccgacat 

tggagatgca 

aggaaatcag 

ctccgggcac 

ccgagcagaa 

gggaggtcgc 

ggcgcaccct 

tggaagacac 

cgctgatcag 

atcaggagta 

accgcagcct 

tctgaggcag 

gatgggaagg 

ggtccaaggg 



ctacagctat 

gcgttttggg 

cggcgtatcc 

cggctacggc 

catgcagaac 

ggccaacggc 

ctcccgcgac 

tgccaccatt 

tgacttccga 

caacggcctg 

gatcgaaggc 

tacgctgagg 

cgatctcgcc 

ccggaaggat 

tggecacacg 

tcagggtctt 

actggcagaa 

cggtattgaa 

ccagcggctc 

gctcgaggga 

caggctctgg 

aagggaccct 



cgccagtcgt 

ccgggggtcg 

gtgtcctccg 

ggcgtcctga 

ctcaacgacc 

gagctagagg 

tacagccact 

gagaactcca 

accaagtttg 

cgcagggtgc 

ctgaaggaag 

ggccaagtgg 

aagatcctga 

gctgaagcct 

gagcagctcc 

gagattgagc 

acggaggcgc 

gcccagctgg 

atggacatca 

caggaagatc 

ggcttctgct 

tacccccggc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1360 
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SUBSTITUTE SHEET (RULE 26) 



BNSDOCID: <WO 0307B662A1 J_> 



WO 03/078662 PCT/0S03/07713 

39740-000lPCT.txt 

<211> 1412 

<212> DNA 

<213> Homo sapiens 

aicactcgct ccaccttctc caccaactac «9tccctgg 9««» Sggctxtggt 180 
tacggcgccc ?gccggtcag "gcgcggcc agcgtctatg caggcg gg 999 gggcctg 2 „. 

ssssss -sssssss- ?i??l|b ssssss : sssss ss 

tttagagtca agtatgagac agagctggcc atgcgccagt "giggag gacagagatc 660 

sksk assja- ssgg g Sssas teas & 

cgagaggagc tagacaagta ctggtctcag "gattgagg Jg9 t a 2 g ?I C agtc 960 

S33K38 3SSSS p t fj|t H^^sg^ggggg "18 

s&ss ssasss? spii HI ?fea a i bs Boo 

gSStg SSS?SS SSSSS383 !A&f«. ggagcaggag 1380 

gccaataaai agttcagagt tcattggatg tc 



<210> 368 

<211> 1075 

<212> DNA 

<213> Homo sapiens 



Relate acatcegtag aaggcagege gecgegg agccgcagcg ccgctcgccc 60, 

g3SS3 KKSSS I II III r c fe sssss 88 

ccatgggcgc cgcggcccgc accctgcggc jggegctegg cctcctg g gC gttttgca 420 
tgcttcgccc ggccgacgcc tgeagctget ccccggtgca cccgca y y y » 4g0 
atgcagatgt agtgatcagg gecaaagegg J^gtgagaa 9g|agtggac yy y 54Q 
acatttatgg caaccctatc aagaggatcc ^gtatgagax caay y gt | tg tgggg 600 
aagggcctga gaaggatata gagtttatct acacggcccc ctccrcggca y y y yyyy 
tctc.gc.tgga cgttggagga aagaaggaat getcattge |g|jaaggcc ^99999^9 72Q 
geaagatgea catcaccctc ^tgacttca tcgtgcccxg gy « acgcgctgcc 780 

Igaagaagag cctgaaccac aggtaccaga ^gggctgega g9« | tg | g ? caca g 840 
ccatgatccc gtgetacate «ctccccgg ^gagtgeer cxgg »» g»jj acggct 900 
agaagaacat caacgggcac "ggccaagt tcttcgcctg «u y y atcgaggacc 960 

311 fill I1PJ ilii bsb aar^w 



<210> 369 

<211> 1127 

<212> DNA 

<213> Homo sapiens 



liil smss ssss llIHIIMl fliii! iilf Is 

gcgggcctgg gggcgctacg Jtgggagagg aagccagggg "cctgcctc tgc 9 24Q 
SS!?SS SSSSS ?J § IS 3 paa gac?ccag« ctgeagagag 300 
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tcatggaggc 
ggaggacgac 
gccccccaac 
gtttgtggac 
gatgcggcaa 
caggggaagc 
cccactgccc 
aaaagaggat 
gtgtgaagcc 
cggaagtggc 
agtcgcgagc 
ggccgtgctc 



gctggggctg 
gaagggatgg 
ctctgggcag 
tcctttaaga 
agctccagct 
tccgccccct 
tgggcagcca 
ccgtgctgtc 
acggaaggct 
tccgtggccc 
caggtttaac 
actaccaaat 



39740-000lPCT.txt 
tggagatccg gagtcgccac agctcctacc ccgcggggac 
gggaggagcc cagccccttt cggggccgct egcgctcggc 
cacagcgcta tggccgcgag ctccggagga tgagtgacga 
agggacttcc tcgcccgaag agcgcgggca cagcaacgca 
ggacgcgagt cttccagtcc tggtgggatc ggaacttggg 
cccagtgacc ttcgctccac atcccgaaac tccacccgtt 
tcttgaatat gggcggaagt acttccetca ggcctatgca 
tcctttggag ggagggctga cccagattcc cttccggtgc 
tggtcccatc ggaagttttg ggttttccgc-ccacagccgc 
cgccctcagg ctccgggctt tcccccaggc gcctgcgcta 
cgttgcgtca ccgggacccg agcccccgcg atgccctggg 
gttaataaag cccgcgtctg tgccgcc 



480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1127 



<210> 370 
<211> 1890 
<212> DNA 

<213> Homo sapiens 



<400> 370 

cttaataaga 

ttatgcttct 

ccactcctct 

ctttcaccaa 

ccatagtggt 

agttttctgg 

tttccagcaa 

attttgggat 

tggaggagtt 

ctccctgcaa 

attttctcac 

tccaggtctg 

tgcttaaaaa 

catcactgga 

aggaaaagga 

atctatttgt 

tgctgaagca 

gacacaggag 

acgagatcca 

atactaagtt 

cataaaacta 

aatttcctaa 

agaaaagtga 

ttgcccgcat 

ctgttgatga 

caccctcata 

atctgctatc 

gacctctcat 

gagagttgtt 

tattaattgc 

taaaacacag 

aaataaaaag 



agagaaggct 

cttttcactc 

tcctattatt 

tttctcaaaa 

gtttcatgga 

aagaggcaat 

tggaaagaga 

ggggaagagg 

gagaaaaacc 

tgtgatctgc 

cetgatgaaa 

caataatttc 

tgttgctctt 

tgttaacaat 

caaccaaaag 

tgctggaaca 

cccagaggtc 

cccctgcatg 

gagatacagt 

cagaaactac 

gggcacaacc 

tccaaatatc 

ctacttcatg 

ggagctattt 

tttaaagaac 

ccagatctgc 

acctgcaact 

caaatcttcc 

caggtcactg 

tgcatatgct 

aaaagtgatt 

tgttattaat 



tcaatggaac 

tggagacaga 

ggaaatatgc 

gtctatggtc 

tatgaggcag 

tccccaatat 

tggaaggaga 

agcattgagg 

aaggcttcac 

tccgttgttt 

agattcaatg 

cctctactca 

acacgaagtt 

cctcgggact 

tcagaattca 

gagacaacaa 

acagctaaag 

caggatagga 

gaccttgtcc 

ctcatcccca 

ataatggcat 

tttgaccctg 

cctttctcag 

ttatttctaa 

ctcaatacta 

ttcatccctg 

ctttttttat 

cattcactca 

cacaaatata 

aatacttttc 

aatgaatgat 

tgctggttca 



cttttgtggt 

gctgtaggag 

tacagataga 

ctgtgttcac 

tgaaggaagc 

ctcaaagaat 

tccggcgttt 

accgtgttca 

cctgtgatcc 

tccagaaacg 

aaaacttcag 

ttgattgttt 

acattaggga 

ttatggattg 

atattgaaaa 

gcaccactct 

tccaggaaga 

gccacatgcc 

ccaccggtgt 

agagctttga 

tactgacttc 

gccactttct 

caggaaaacg 

ccacaatttt 

ctgcagttac 

tctgaagaat 

caaggacatt 

atatcccata 

tctgcaatta 

taatgctgac 

aatttagtcc 



cctggtgctg 

aaggaagctc 

tgttaaggac 

cgtgtatttt 

cctgattgat 

tactaaagga 

ctccctcaca 

agaggaagct 

cactttcatc 

atttgattat 

gattctgaac 

cccaggaact 

gaaagtaaaa 

cttcctgatc 

cttggttggc 

gagatatgga 

gattgatcat 

ttacactgat 

gccecatgca 

taacaagata 

cgtgctacat 

agataagaat 

aatttgtgca 

acagaacttt 

caaagggatt 

gctagcccat 

cccactatta 

agcatccaaa 

ttcatactct 

tttttaatat 

atttcttttg 



tgtctctctt 

cctcctggcc 

atctgcaaat 

ggcatgaatc 

aatggagagg 

cttggaatca 

aacttgcgga 

cactgccttg 

ctgggctgtg 

aaagatcaga 

tccccatgga 

cacaacaaag 

gaacaccaag 

aaaatggagc 

actgtagctg 

ctcctgctcc 

gtaattggca 

gctgtagtgc 

gtgaccactg 

atgctggctg 

gatgacaaag 

ggcaacttta 

ggagaaggac 

aacctgaaat 

gtttctctgc 

ctggctgctg 

tgtcttctct 

ctccattaag 

gtaacacttg 

gttatcactg 

tgaatgtgct 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1890 



<210> 371 
<211> 4946 
<212> DNA 

<213> Homo sapiens 



<400> 371 

agtcagccct 

actgcgggac 

cagggctctc 

cggtcggctc 

ccgggcaatg 

gaatgacatc 

tgaggttcac 

ggagccgaag 

caacggaccg 

cccgcgaaaa 



gctgccagcc agtgccgggt gctggggact cagggaggcc 
agtgagccga gcagaagctg gaacgcagga gaggaaggag 
aggagccggg tcctgggcaa ggcgcagccg ttttcaaatt 
acactcgagc agtaaaaaga tgcctctggg gaggaggccc 
gtggtggctc ggcctagaga ggcggtagtg gaacgcagac 
aagggaggag acgggcggga ccccagattt ctgcctgtgg 
tggccagcgg agccggacao agaacgcgca aaacgccgtg 
agcaggcgga ccccctccgc gggggaacag tttccgccgg 
gaagtggggg gcggaagtgc agtgggctca gcgccgactg 
ctctgagctg gctgacagct ggggacgggt ggcggccctc 
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cgccgggacc 60 
agggggcggt 120 
ttcaggaaag 180 
gtgcagctct 240 
cctggtgggg 300 
gcgatggaag 360 
taggcctgga 420 
gagcacaaag 480 
cgcgcctctg 540 
gactggagtc 600 



SUBSTITUTE SHEET (RULE 26) 



BNSDOCID: <WO 03078662A1 J_> 



WO 03/078662 



PCT/US03/07713 



ggttgagttc 
tgtgccatag 
ttaccaggag 
agcctgcccc 
tcctgaatgg 
tgaccaaggt 
gtacagggaa 
acggcatctc 
ccttgtcatc 
acatcccaaa 
agcttcggaa 
gcatccatcc 
agaaggtggc 
aggagctggc 
gggtgtgccc 
cgtacaatta 
cagtcgtgat 
ttgacctgac 
agcagaccaa 
gcccagggct 
tggagggggc 
ggagctacat 
tcctggactc 
acacggccgg 
ccgagggcag 
tccatcctga 
cagccagaaa 
acgagctggt 
tgtcctcctt 
tcatcgacaa 
tgagctccgc 
tgggcaacat 
tcatggagaa 
tgaagccgct 
actatgcaag 
gcaaggccag 
gcctcccgta 
agatgaaggg 
agcaggcgtc 
acggagctgt 
cctcctgggt 
tggcccagtt 
cacccagtgt 
ccaccaggaa 
ggtcaccagc 
ttcctgcccg 
cggggagccc 
agaggccggc 
aggagcccgt 
aggagttgag 
ccgatgactt 
agcacaacct 
aggaggtctg 
cccgaaggca 
agctgaccgt 
gcaggcccca 
gggggtctgg 
ctgatgcccg 
cctataagca 
caaagccaga 
agcagcgctt 
caccgggacc 
aaccaggccc 
ttagacagag 
cctcaggacc 
tgccacaccg 
gtgcccacgg 
gtgggccaag 



ctgagggacc 
ggattctcga 
tgcccgagac 
tcagccacgc 
tgtgaccgta 
cctggaatgt 
gacgctgtgc 
tgcccgcaag 
ctggggcaac 
gattatttac 
cacctcctac 
tgaggtgaag 
aagtcgctcc 
cagccccatc 
ttactacctg 
cttgttggat 
ctttgacgaa 
tccccatgac 
ggcagcgcag 
gaacatggag 
catcgatgct 
ctttgagctg 
gctggaccag 
actgcagaag 
ccctggttxc 
tgctggtcac 
gcgagggaag 
ccgccagggc 
tgctctggag 
gcaccagatc 
gtttgacaga 
cgcccgcgtg 
gagcctggag 
gtttgtggag 
ggttgccgcc 
cgaggggctg 
ccccccacgc 
ccagggtggg 
cagggctgtg 
cttcctctgt 
gcgtccccac 
cttccgtgtt 
gcgtggagaa 
agctaagagt 
tgccggggac 
gcagaggccc 
tggcgaggag 
agaagaaccg 
ggctggtgca 
ccaagccaac 
cgccgccctg 
gctccaaggc 
tatccagctg 
gcgggcacag 
gtccacggct 
cctgtcgccc 
agtgcccaga 
cagggccctg 
agacgacgac 
ggacttcccc 
ctcacagacg 
ccaggaggag 
ctcacggtcc 
gccagcaggg 
tccccacggg 
cctccaggaa 
aggcccccag 
aacccaccca 



39740-000lPCT.txt 
ccggttctgg aaggttcgcc gcggagacaa 
agagaacagc gttgtgtccc agtgcacatg 
cctaagatgt tcggagtggt tttttcgcac 
tctgtgccct tctgagaaca ggctgatatg 
gacttccctt tccagcccta caaatgccaa 
ctgcagcaga aggtgaatgg catcctggag 
ctgctgtgca ccacgctggc ctggcgagaa 
attgccgaga gggcgcaagg agagcttttc 
qctgctgctg ctgctggaga ccccatagct 
qcctccagga cccactcgca actcacacag 
cggcctaagg tgtgtgtgct gggctcccgg 
aaacaagaga gtaaccatct acagatccac 
tqtcatttct acaacaacgt agaagaaaaa 
ctggacattg aggacttggt caagagcgga 
tcccggaacc tgaagcagca agccgacatc 
qccaagagcc gcagagcaca caacattgac 
qctcacaacg tggagaagat gtgtgaagaa 
ctggcttcag gactggacgt catagaccag 
caqggtgagc cccacccgga gttcagcgcg 
ctggaagaca ttgcaaagct gaagatgatc 
qttgagctgc ctggagacga cagcggtgtc 
tttgctgaag cccagatcac gtttcagacc 
atcatccagc acctggcagg acgtgctgga 
ctggcggaca ttatccagat tgtgttcagt 
ccagcagggc tgggggcctt aca?tcctat 
cggaggacgg ctcagcggtc tgatgcctgg 
gtgctgagct actggtgctt cagtcccggc 
qtccgctccc tcatccttac cagcggcacg 
atgcagatcc ctttcccagt ctgcctggag 
tqqqtggggg tcgtccccag aggccccgat 
cggttttEcg aggagtgctt atcctccctg 
gtgccctatg ggctcctgat cttcttccct 
ttctggcggg cccgcgactt ggccaggaag 
cccaggagca aaggcagctt ctccgagacc 
cctgggtcca ccggcgccac cttcctggcg 
gacttctcag acacgaatgg ccgtggtgtg 
itggaccccc gggttgtcct caagatgcag 
qctgggggcc agttcctctc tgggcaggag 
aaccaggcca tcgggcgagt gatccggcac 
gaccacaggt tcgcctttgc cgacgcaaga 
qtcagggtgt atgacaactt tggccatgtc 
qccgagcgaa ctatgccagc gccggccccc 
qatgctgtca gcgaggccaa gtcgcctggc 
ctqqacctgc atgtccceag cctgaagcag 
cccgagagta gcctgtgtgt ggagtatgag 
agggggctgc tggccgccct ggagcacagc 
clggcccaca gctgctccac cctgtccctc 
cgaggaggga ggaagaagat ccggctggtc 
clgacggaca gggccaagct cttcatggtg 
tttgccacct tcacccaggc cctgcaggac 
gccgcctgtc tcggccccct ctttgctgag 
ttctaccagt ttgtgcggcc ccaccataag 
acaggacgag gctgtggcta tcggcctgag 
ccggtcctgg accccactgg aagaacggcg 
qcagcccagc agctggaccc ccaagagcac 
agqccacccc caacaggaga ccctggcagc 
gcagggaagc agggccagca cgccgtgagc 
qggtccgcgg gctgtagcca actcttggca 
ctcgaciagg tgctggctgt Qttggccgcc 
ctgctgcaca ggttcagcat gtttgtgcgt 
tqcacagacc tgaccggccg gccctacccg 
aggcttgccg tgcctcctgt gcttacccac 
qagaagaccg ggaagaccca gagcaagatc 
actgtggggg cgggcggtga ggatgcaggt 
cctgcagcat ctgagtgggg cctctaggat 
gcagagcgtc atgcaggtct tctggccaga 
cacacccaac gtggcttgat " cct 9"tg 
acagaatagg ccagcccatg ccagccggct 
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gtgagcagtc 
ctxgcatcgc 
agacccgaat 
cccaagatag 
caggagtaca 
agccctacgg 
caectccgag 
ccggatcggg 
tgctacacgg 
gtcatcaacg 
gagcagctgt 
ttgtgccgta 
agcctggagc 
agcaagcaca 
atattcatgc 
ctgaagggga 
tcggcatcct 
gtgctggagg 
gactccccca 
ctgctgcgcc 
accaagccag 
aagggctgca 
gtgttcacca 
gtggacccct 
aaggtgcaca 
agcaccactg 
cacagcatgc 
ctggccccgg 
aacccacaca 
ggagcccagt 
gggaaggctc 
tcctatcctg 
atggaggcgc 
atcagtgctt 
gtctgccggg 
attgtcacgg 
ttcctggatg 
tggtaccggc 
cgccaggact 
gcccaactgc 
atccgagacg 
cgggctacag 
cccttcttct 
aggtcctcag 
caggagccag 
gaacagcggg 
ctgtctgaga 
agccacccgg 
gccgtgaagc 
tacaagggtt 
gaccccaaga 
cagcagtttg 
cacagcattc 
ccggatccca 
ctgaaccagg 
caaccacagt 
gcctacctgg 
gcgctgacag 
ctgaccactg 
ccacaccaca 
ggcatggagc 
agggctcccc 
tcgtccttcc 
cccagccagt 
gtgcccagcc 
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